Optimal Bipartite Network Clustering

03/15/2018
by   Zhixin Zhou, et al.
0

We consider the problem of bipartite community detection in networks, or more generally the network biclustering problem. We present a fast two-stage procedure based on spectral initialization followed by the application of a pseudo-likelihood classifier twice. Under mild regularity conditions, we establish the weak consistency of the procedure (i.e., the convergence of the misclassification rate to zero) under a general bipartite stochastic block model. We show that the procedure is optimal in the sense that it achieves the optimal convergence rate that is achievable by a biclustering oracle, adaptively over the whole class, up to constants. The optimal rate we obtain sharpens some of the existing results and generalizes others to a wide regime of average degree growth. As a special case, we recover the known exact recovery threshold in the n regime of sparsity. To obtain the general consistency result, as part of the provable version of the algorithm, we introduce a sub-block partitioning scheme that is also computationally attractive, allowing for distributed implementation of the algorithm without sacrificing optimality. The provable version of the algorithm is derived from a general blueprint for pseudo-likelihood biclustering algorithms that employ simple EM type updates. We show the effectiveness of this general class by numerical simulations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/18/2019

Improved clustering algorithms for the Bipartite Stochastic Block Model

We consider a Bipartite Stochastic Block Model (BSBM) on vertex sets V_1...
research
11/01/2020

Fast Network Community Detection with Profile-Pseudo Likelihood Methods

The stochastic block model is one of the most studied network models for...
research
03/12/2018

Analysis of spectral clustering algorithms for community detection: the general bipartite setting

We consider the analysis of spectral clustering algorithms for community...
research
09/15/2022

Clustering Network Vertices in Sparse Contextual Multilayer Networks

We consider the problem of learning the latent community structure in a ...
research
05/14/2015

Achieving Optimal Misclassification Proportion in Stochastic Block Model

Community detection is a fundamental statistical problem in network data...
research
04/14/2023

Strong Consistency Guarantees for Clustering High-Dimensional Bipartite Graphs with the Spectral Method

In this work, we focus on the Bipartite Stochastic Block Model (BiSBM), ...
research
05/27/2021

Comparing Models for Extracting the Backbone of Bipartite Projections

Projections of bipartite or two-mode networks capture co-occurrences, an...

Please sign up or login with your details

Forgot password? Click here to reset