Statistical inference of assortative community structures

06/25/2020
by   Lizhi Zhang, et al.
0

We develop a principled methodology to infer assortative communities in networks based on a nonparametric Bayesian formulation of the planted partition model. We show that this approach succeeds in finding statistically significant assortative modules in networks, unlike alternatives such as modularity maximization, which systematically overfits both in artificial as well as in empirical examples. In addition, we show that our method is not subject to a resolution limit, and can uncover an arbitrarily large number of communities, as long as there is statistical evidence for them. Our formulation is amenable to model selection procedures, which allow us to compare it to more general approaches based on the stochastic block model, and in this way reveal whether assortativity is in fact the dominating large-scale mixing pattern. We perform this comparison with several empirical networks, and identify numerous cases where the network's assortativity is exaggerated by traditional community detection methods, and we show how a more faithful degree of assortativity can be identified.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/09/2016

Nonparametric Bayesian inference of the microcanonical stochastic block model

A principled approach to characterize the hidden structure of networks i...
research
02/07/2023

Consistent model selection for the Degree Corrected Stochastic Blockmodel

The Degree Corrected Stochastic Block Model (DCSBM) was introduced by <c...
research
03/30/2022

Ordered community detection in directed networks

We develop a method to infer community structure in directed networks wh...
research
09/21/2007

A Bayesian Approach to Network Modularity

We present an efficient, principled, and interpretable technique for inf...
research
05/23/2016

Bayesian Model Selection of Stochastic Block Models

A central problem in analyzing networks is partitioning them into module...
research
09/04/2020

Unlucky Number 13? Manipulating Evidence Subject to Snooping

Questionable research practices like HARKing or p-hacking have generated...
research
10/17/2019

Minimum entropy stochastic block models neglect edge distribution heterogeneity

The statistical inference of stochastic block models as emerged as a mat...

Please sign up or login with your details

Forgot password? Click here to reset