Hypothesis Testing for Automated Community Detection in Networks

11/12/2013
by   Peter J. Bickel, et al.
0

Community detection in networks is a key exploratory tool with applications in a diverse set of areas, ranging from finding communities in social and biological networks to identifying link farms in the World Wide Web. The problem of finding communities or clusters in a network has received much attention from statistics, physics and computer science. However, most clustering algorithms assume knowledge of the number of clusters k. In this paper we propose to automatically determine k in a graph generated from a Stochastic Blockmodel. Our main contribution is twofold; first, we theoretically establish the limiting distribution of the principal eigenvalue of the suitably centered and scaled adjacency matrix, and use that distribution for our hypothesis test. Secondly, we use this test to design a recursive bipartitioning algorithm. Using quantifiable classification tasks on real world networks with ground truth, we show that our algorithm outperforms existing probabilistic models for learning overlapping clusters, and on unlabeled networks, we show that we uncover nested community structure.

READ FULL TEXT

page 7

page 8

page 19

page 20

research
08/20/2016

The ground truth about metadata and community detection in networks

Across many scientific domains, there is a common need to automatically ...
research
02/24/2017

Hidden Community Detection in Social Networks

We introduce a new paradigm that is important for community detection in...
research
02/28/2018

Evaluating Overfit and Underfit in Models of Network Community Structure

A common data mining task on networks is community detection, which seek...
research
03/06/2023

Well-Connected Communities in Real-World and Synthetic Networks

Integral to the problem of detecting communities through graph clusterin...
research
01/16/2014

Community Detection in Networks using Graph Distance

The study of networks has received increased attention recently not only...
research
11/30/2012

Multislice Modularity Optimization in Community Detection and Image Segmentation

Because networks can be used to represent many complex systems, they hav...
research
08/27/2023

Superpixels algorithms through network community detection

Community detection is a powerful tool from complex networks analysis th...

Please sign up or login with your details

Forgot password? Click here to reset