Spectral clustering under degree heterogeneity: a case for the random walk Laplacian

05/03/2021
by   Alexander Modell, et al.
0

This paper shows that graph spectral embedding using the random walk Laplacian produces vector representations which are completely corrected for node degree. Under a generalised random dot product graph, the embedding provides uniformly consistent estimates of degree-corrected latent positions, with asymptotically Gaussian error. In the special case of a degree-corrected stochastic block model, the embedding concentrates about K distinct points, representing communities. These can be recovered perfectly, asymptotically, through a subsequent clustering step, without spherical projection, as commonly required by algorithms based on the adjacency or normalised, symmetric Laplacian matrices. While the estimand does not depend on degree, the asymptotic variance of its estimate does – higher degree nodes are embedded more accurately than lower degree nodes. Our central limit theorem therefore suggests fitting a weighted Gaussian mixture model as the subsequent clustering step, for which we provide an expectation-maximisation algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/02/2013

Perfect Clustering for Stochastic Blockmodel Graphs via Adjacency Spectral Embedding

Vertex clustering in a stochastic blockmodel graph has wide applicabilit...
research
11/09/2020

Spectral clustering on spherical coordinates under the degree-corrected stochastic blockmodel

Spectral clustering is a popular method for community detection in netwo...
research
03/30/2020

Spectral graph clustering via the Expectation-Solution algorithm

The stochastic blockmodel (SBM) models the connectivity within and betwe...
research
03/11/2013

Spectral Clustering with Epidemic Diffusion

Spectral clustering is widely used to partition graphs into distinct mod...
research
10/12/2019

Spectral clustering in the weighted stochastic block model

This paper is concerned with the statistical analysis of a real-valued s...
research
08/23/2018

On a 'Two Truths' Phenomenon in Spectral Graph Clustering

Clustering is concerned with coherently grouping observations without an...
research
11/02/2015

An Impossibility Result for Reconstruction in a Degree-Corrected Planted-Partition Model

We consider a Degree-Corrected Planted-Partition model: a random graph o...

Please sign up or login with your details

Forgot password? Click here to reset