Instance-Optimal Cluster Recovery in the Labeled Stochastic Block Model

06/18/2023
by   Kaito Ariu, et al.
0

We consider the problem of recovering hidden communities in the Labeled Stochastic Block Model (LSBM) with a finite number of clusters, where cluster sizes grow linearly with the total number n of items. In the LSBM, a label is (independently) observed for each pair of items. Our objective is to devise an efficient algorithm that recovers clusters using the observed labels. To this end, we revisit instance-specific lower bounds on the expected number of misclassified items satisfied by any clustering algorithm. We present Instance-Adaptive Clustering (IAC), the first algorithm whose performance matches these lower bounds both in expectation and with high probability. IAC consists of a one-time spectral clustering algorithm followed by an iterative likelihood-based cluster assignment improvement. This approach is based on the instance-specific lower bound and does not require any model parameters, including the number of clusters. By performing the spectral clustering only once, IAC maintains an overall computational complexity of 𝒪(n polylog(n)). We illustrate the effectiveness of our approach through numerical experiments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/20/2015

Optimal Cluster Recovery in the Labeled Stochastic Block Model

We consider the problem of community detection or clustering in the labe...
research
10/14/2019

Optimal Clustering from Noisy Binary Feedback

We study the problem of recovering clusters from binary user feedback. I...
research
11/04/2022

Model-based graph clustering of a collection of networks using an agglomerative algorithm

Graph clustering is the task of partitioning a collection of observed ne...
research
08/08/2015

Minimax Optimal Variable Clustering in G-Block Correlation Models via Cord

The goal of variable clustering is to partition a random vector X∈ R^p ...
research
10/11/2012

Improved Graph Clustering

Graph clustering involves the task of dividing nodes into clusters, so t...
research
12/15/2015

Relative Density and Exact Recovery in Heterogeneous Stochastic Block Models

The Stochastic Block Model (SBM) is a widely used random graph model for...
research
02/17/2022

Recovering Unbalanced Communities in the Stochastic Block Model With Application to Clustering with a Faulty Oracle

The stochastic block model (SBM) is a fundamental model for studying gra...

Please sign up or login with your details

Forgot password? Click here to reset