Scaling Graph Clustering with Distributed Sketches

07/24/2020
by   Benjamin W. Priest, et al.
0

The unsupervised learning of community structure, in particular the partitioning vertices into clusters or communities, is a canonical and well-studied problem in exploratory graph analysis. However, like most graph analyses the introduction of immense scale presents challenges to traditional methods. Spectral clustering in distributed memory, for example, requires hundreds of expensive bulk-synchronous communication rounds to compute an embedding of vertices to a few eigenvectors of a graph associated matrix. Furthermore, the whole computation may need to be repeated if the underlying graph changes some low percentage of edge updates. We present a method inspired by spectral clustering where we instead use matrix sketches derived from random dimension-reducing projections. We show that our method produces embeddings that yield performant clustering results given a fully-dynamic stochastic block model stream using both the fast Johnson-Lindenstrauss and CountSketch transforms. We also discuss the effects of stochastic block model parameters upon the required dimensionality of the subsequent embeddings, and show how random projections could significantly improve the performance of graph clustering in distributed memory.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/09/2020

Dual regularized Laplacian spectral clustering methods on community detection

Spectral clustering methods are widely used for detecting clusters in ne...
research
06/12/2017

Fast Approximate Spectral Clustering for Dynamic Networks

Spectral clustering is a widely studied problem, yet its complexity is p...
research
04/05/2019

Simultaneous Dimensionality and Complexity Model Selection for Spectral Graph Clustering

Our problem of interest is to cluster vertices of a graph by identifying...
research
08/21/2017

Preconditioned Spectral Clustering for Stochastic Block Partition Streaming Graph Challenge

Locally Optimal Block Preconditioned Conjugate Gradient (LOBPCG) is demo...
research
07/31/2023

Percolated stochastic block model via EM algorithm and belief propagation with non-backtracking spectra

Whereas Laplacian and modularity based spectral clustering is apt to den...
research
11/08/2014

Covariate-assisted spectral clustering

Biological and social systems consist of myriad interacting units. The i...
research
07/05/2021

Template-Based Graph Clustering

We propose a novel graph clustering method guided by additional informat...

Please sign up or login with your details

Forgot password? Click here to reset