Weighted Community Detection and Data Clustering Using Message Passing

01/30/2018
by   Cheng Shi, et al.
0

Grouping objects into clusters based on similarities or weights between them is one of the most important problems in science and engineering. In this work, by extending message passing algorithms and spectral algorithms proposed for unweighted community detection problem, we develop a non-parametric method based on statistical physics, by mapping the problem to Potts model at the critical temperature of spin glass transition and applying belief propagation to solve the marginals corresponding to the Boltzmann distribution. Our algorithm is robust to over-fitting and gives a principled way to determine whether there are significant clusters in the data and how many clusters there are. We apply our method to different clustering tasks and use extensive numerical experiments to illustrate the advantage of our method over existing algorithms. In the community detection problem in weighted and directed networks, we show that our algorithm significantly outperforms existing algorithms. In the clustering problem when the data was generated by mixture models in the sparse regime we show that our method works to the theoretical limit of detectability and gives accuracy very close to that of the optimal Bayesian inference. In the semi-supervised clustering problem, our method only needs several labels to work perfectly in classic datasets. Finally, we further develop Thouless-Anderson-Palmer equations which reduce heavily the computation complexity in dense-networks but gives almost the same performance as belief propagation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/24/2019

Accuracy-Memory Tradeoffs and Phase Transitions in Belief Propagation

The analysis of Belief Propagation and other algorithms for the reconst...
research
03/17/2023

Neural-prior stochastic block model

The stochastic block model (SBM) is widely studied as a benchmark for gr...
research
04/30/2014

A Bi-clustering Framework for Consensus Problems

We consider grouping as a general characterization for problems such as ...
research
10/04/2017

Spectral estimation of the percolation transition in clustered networks

There have been several spectral bounds for the percolation transition i...
research
04/20/2020

Flow-based Algorithms for Improving Clusters: A Unifying Framework, Software, and Performance

Clustering points in a vector space or nodes in a graph is a ubiquitous ...
research
07/22/2014

Sequential Changepoint Approach for Online Community Detection

We present new algorithms for detecting the emergence of a community in ...

Please sign up or login with your details

Forgot password? Click here to reset