Robust and computationally feasible community detection in the presence of arbitrary outlier nodes

04/23/2014
by   T. Tony Cai, et al.
0

Community detection, which aims to cluster N nodes in a given graph into r distinct groups based on the observed undirected edges, is an important problem in network data analysis. In this paper, the popular stochastic block model (SBM) is extended to the generalized stochastic block model (GSBM) that allows for adversarial outlier nodes, which are connected with the other nodes in the graph in an arbitrary way. Under this model, we introduce a procedure using convex optimization followed by k-means algorithm with k=r. Both theoretical and numerical properties of the method are analyzed. A theoretical guarantee is given for the procedure to accurately detect the communities with small misclassification rate under the setting where the number of clusters can grow with N. This theoretical result admits to the best-known result in the literature of computationally feasible community detection in SBM without outliers. Numerical results show that our method is both computationally fast and robust to different kinds of outliers, while some popular computationally fast community detection algorithms, such as spectral clustering applied to adjacency matrices or graph Laplacians, may fail to retrieve the major clusters due to a small portion of outliers. We apply a slight modification of our method to a political blogs data set, showing that our method is competent in practice and comparable to existing computationally feasible methods in the literature. To the best of the authors' knowledge, our result is the first in the literature in terms of clustering communities with fast growing numbers under the GSBM where a portion of arbitrary outlier nodes exist.

READ FULL TEXT

page 21

page 22

page 23

research
06/16/2019

Community Detection Based on the L_∞ convergence of eigenvectors in DCBM

Spectral clustering is one of the most popular algorithms for community ...
research
09/24/2020

Sketch-based community detection in evolving networks

We consider an approach for community detection in time-varying networks...
research
11/29/2019

Link Prediction in the Stochastic Block Model with Outliers

The Stochastic Block Model is a popular model for network analysis in th...
research
09/30/2018

Convex Relaxation Methods for Community Detection

This paper surveys recent theoretical advances in convex optimization ap...
research
07/20/2020

Integrating Network Embedding and Community Outlier Detection via Multiclass Graph Description

Network (or graph) embedding is the task to map the nodes of a graph to ...
research
08/25/2022

Adaptive Weights Community Detection

Due to the technological progress of the last decades, Community Detecti...
research
06/08/2016

On clustering network-valued data

Community detection, which focuses on clustering nodes or detecting comm...

Please sign up or login with your details

Forgot password? Click here to reset