Improved Graph Clustering

10/11/2012
by   Yudong Chen, et al.
0

Graph clustering involves the task of dividing nodes into clusters, so that the edge density is higher within clusters as opposed to across clusters. A natural, classic and popular statistical setting for evaluating solutions to this problem is the stochastic block model, also referred to as the planted partition model. In this paper we present a new algorithm--a convexified version of Maximum Likelihood--for graph clustering. We show that, in the classic stochastic block model setting, it outperforms existing methods by polynomial factors when the cluster size is allowed to have general scalings. In fact, it is within logarithmic factors of known lower bounds for spectral methods, and there is evidence suggesting that no polynomial time algorithm would do significantly better. We then show that this guarantee carries over to a more general extension of the stochastic block model. Our method can handle the settings of semi-random graphs, heterogeneous degree distributions, unequal cluster sizes, unaffiliated nodes, partially observed graphs and planted clique/coloring etc. In particular, our results provide the best exact recovery guarantees to date for the planted partition, planted k-disjoint-cliques and planted noisy coloring models with general cluster sizes; in other settings, we match the best existing results up to logarithmic factors.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/06/2014

Statistical-Computational Tradeoffs in Planted Problems and Submatrix Localization with a Growing Number of Clusters and Submatrices

We consider two closely related problems: planted clustering and submatr...
research
04/25/2011

Clustering Partially Observed Graphs via Convex Optimization

This paper considers the problem of clustering a partially observed unwe...
research
02/13/2018

A Faster Isomorphism Test for Graphs of Small Degree

Luks's algorithm (JCSS 1982) to test isomorphism of bounded degree graph...
research
06/18/2023

Instance-Optimal Cluster Recovery in the Labeled Stochastic Block Model

We consider the problem of recovering hidden communities in the Labeled ...
research
05/09/2016

Exact ICL maximization in a non-stationary temporal extension of the stochastic block model for dynamic networks

The stochastic block model (SBM) is a flexible probabilistic tool that c...
research
03/23/2020

Hypergraph Clustering in the Weighted Stochastic Block Model via Convex Relaxation of Truncated MLE

We study hypergraph clustering under the weighted d-uniform hypergraph s...
research
08/29/2023

Clustering Without an Eigengap

We study graph clustering in the Stochastic Block Model (SBM) in the pre...

Please sign up or login with your details

Forgot password? Click here to reset