Log In Sign Up

Randomized Spectral Clustering in Large-Scale Stochastic Block Models

by   Hai Zhang, et al.

Spectral clustering has been one of the widely used methods for community detection in networks. However, large-scale networks bring computational challenge to it. In this paper, we study spectral clustering using randomized sketching algorithms from a statistical perspective, where we typically assume the network data are generated from a stochastic block model. To do this, we first use the recent developed sketching algorithms to derive two randomized spectral clustering algorithms, namely, the random projection-based and the random sampling-based spectral clustering. Then we study the theoretical bounds of the resulting algorithms in terms of the approximation error for the population adjacency matrix, the misclustering error, and the estimation error for the link probability matrix. It turns out that, under mild conditions, the randomized spectral clustering algorithms perform similarly to the original one. We also conduct numerical experiments to support the theoretical findings.


Randomized spectral co-clustering for large-scale directed networks

Directed networks are generally used to represent asymmetric relationshi...

Distributed Community Detection for Large Scale Networks Using Stochastic Block Model

With rapid developments of information and technology, large scale netwo...

Spectral learning of multivariate extremes

We propose a spectral clustering algorithm for analyzing the dependence ...

Sparse Quantized Spectral Clustering

Given a large data matrix, sparsifying, quantizing, and/or performing ot...

A Unified Framework for Tuning Hyperparameters in Clustering Problems

Selecting hyperparameters for unsupervised learning problems is difficul...

Network Summarization with Preserved Spectral Properties

Large-scale networks are widely used to represent object relationships i...

Analysis of spectral clustering algorithms for community detection: the general bipartite setting

We consider the analysis of spectral clustering algorithms for community...