Improved clustering algorithms for the Bipartite Stochastic Block Model

11/18/2019
by   Mohamed Ndaoud, et al.
0

We consider a Bipartite Stochastic Block Model (BSBM) on vertex sets V_1 and V_2, and investigate asymptotic sufficient conditions of exact and almost full recovery for polynomial-time algorithms of clustering over V_1, in the regime where the cardinalities satisfy |V_1|≪|V_2|. We improve upon the known conditions of almost full recovery for spectral clustering algorithms in BSBM. Furthermore, we propose a new computationally simple procedure achieving exact recovery under milder conditions than the state of the art. This procedure is a variant of Lloyd's iterations initialized with a well-chosen spectral algorithm leading to what we expect to be optimal conditions for exact recovery in this model. The key elements of the proof techniques are different from classical community detection tools on random graphs. In particular, we develop a heavy-tailed variant of matrix Bernstein inequality. Finally, using the connection between planted satisfiability problems and the BSBM, we improve upon the sufficient number of clauses to completely recover the planted assignment.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/21/2020

Strong Consistency, Graph Laplacians, and the Stochastic Block Model

Spectral clustering has become one of the most popular algorithms in dat...
research
03/15/2018

Optimal Bipartite Network Clustering

We consider the problem of bipartite community detection in networks, or...
research
03/12/2018

Analysis of spectral clustering algorithms for community detection: the general bipartite setting

We consider the analysis of spectral clustering algorithms for community...
research
04/14/2023

Strong Consistency Guarantees for Clustering High-Dimensional Bipartite Graphs with the Spectral Method

In this work, we focus on the Bipartite Stochastic Block Model (BiSBM), ...
research
02/06/2014

Statistical-Computational Tradeoffs in Planted Problems and Submatrix Localization with a Growing Number of Clusters and Submatrices

We consider two closely related problems: planted clustering and submatr...
research
10/22/2018

Recovery, detection and confidence sets of communities in a sparse stochastic block model

Posterior distributions for community assignment in the planted bi-secti...
research
06/28/2020

Community detection and percolation of information in a geometric setting

We make the first steps towards generalizing the theory of stochastic bl...

Please sign up or login with your details

Forgot password? Click here to reset