Recovering Unbalanced Communities in the Stochastic Block Model With Application to Clustering with a Faulty Oracle

02/17/2022
by   Chandra Sekhar Mukherjee, et al.
0

The stochastic block model (SBM) is a fundamental model for studying graph clustering or community detection in networks. It has received great attention in the last decade and the balanced case, i.e., assuming all clusters have large size, has been well studied. However, our understanding of SBM with unbalanced communities (arguably, more relevant in practice) is still very limited. In this paper, we provide a simple SVD-based algorithm for recovering the communities in the SBM with communities of varying sizes. Under the KS-threshold conjecture, the tradeoff between the parameters in our algorithm is nearly optimal up to polylogarithmic factors for a wide range of regimes. As a byproduct, we obtain a time-efficient algorithm with improved query complexity for a clustering problem with a faulty oracle, which improves upon a number of previous work (Mazumdarand Saha [NIPS 2017], Larsen, Mitzenmacher and Tsourakakis [WWW 2020], Peng and Zhang[COLT 2021]). Under the KS-threshold conjecture, the query complexity of our algorithm is nearly optimal up to polylogarithmic factors.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/18/2021

Towards a Query-Optimal and Time-Efficient Algorithm for Clustering with a Faulty Oracle

Motivated by applications in crowdsourced entity resolution in database,...
research
02/02/2021

Community Detection with a Subsampled Semidefinite Program

Semidefinite programming is an important tool to tackle several problems...
research
07/03/2015

Estimating the number of communities in networks by spectral methods

Community detection is a fundamental problem in network analysis with ma...
research
04/10/2018

Strong consistency of Krichevsky-Trofimov estimator for the number of communities in the Stochastic Block Model

In this paper we introduce the Krichevsky-Trofimov estimator for the num...
research
03/06/2023

Well-Connected Communities in Real-World and Synthetic Networks

Integral to the problem of detecting communities through graph clusterin...
research
06/18/2023

Instance-Optimal Cluster Recovery in the Labeled Stochastic Block Model

We consider the problem of recovering hidden communities in the Labeled ...
research
06/13/2020

Learning Sparse Graphons and the Generalized Kesten-Stigum Threshold

The problem of learning graphons has attracted considerable attention ac...

Please sign up or login with your details

Forgot password? Click here to reset