Preconditioned Spectral Clustering for Stochastic Block Partition Streaming Graph Challenge

08/21/2017
by   David Zhuzhunashvili, et al.
0

Locally Optimal Block Preconditioned Conjugate Gradient (LOBPCG) is demonstrated to efficiently solve eigenvalue problems for graph Laplacians that appear in spectral clustering. For static graph partitioning, 10-20 iterations of LOBPCG without preconditioning result in 10x error reduction, enough to achieve 100 partitions, e.g., for graphs with 5K/.1M (50K/1M) Vertices/Edges in 2 (7) seconds, compared to over 5,000 (30,000) seconds needed by the baseline Python code. Our Python code 100 Challenge static graphs with 0.5M (2M) vertices in 270 (1,700) seconds using 10GB (50GB) of memory. Our single-precision MATLAB code calculates the same clusters at half time and memory. For streaming graph partitioning, LOBPCG is initiated with approximate eigenvectors of the graph Laplacian already computed for the previous graph, in many cases reducing 2-3 times the number of required LOBPCG iterations, compared to the static case. Our spectral clustering is generic, i.e. assuming nothing specific of the block model or streaming, used to generate the graphs for the Challenge, in contrast to the base code. Nevertheless, in 10-stage streaming comparison with the base code for the 5K graph, the quality of our clusters is similar or better starting at stage 4 (7) for emerging edging (snowballing) streaming, while the computations are over 100-1000 faster.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/07/2020

Average Sensitivity of Spectral Clustering

Spectral clustering is one of the most popular clustering methods for fi...
research
01/20/2020

2PS: High-Quality Edge Partitioning with Two-Phase Streaming

Graph partitioning is an important preprocessing step to distributed gra...
research
08/25/2017

Streaming Graph Challenge: Stochastic Block Partition

An important objective for analyzing real-world graphs is to achieve sca...
research
07/24/2020

Scaling Graph Clustering with Distributed Sketches

The unsupervised learning of community structure, in particular the part...
research
01/07/2013

Efficient Eigen-updating for Spectral Graph Clustering

Partitioning a graph into groups of vertices such that those within each...
research
05/16/2023

Spectral Clustering via Orthogonalization-Free Methods

Graph Signal Filter used as dimensionality reduction in spectral cluster...
research
03/22/2020

Spectral Clustering Revisited: Information Hidden in the Fiedler Vector

We are interested in the clustering problem on graphs: it is known that ...

Please sign up or login with your details

Forgot password? Click here to reset