Spectral clustering in the weighted stochastic block model

10/12/2019
by   Ian Gallagher, et al.
0

This paper is concerned with the statistical analysis of a real-valued symmetric data matrix. We assume a weighted stochastic block model: the matrix indices, taken to represent nodes, can be partitioned into communities so that all entries corresponding to a given community pair are replicates of the same random variable. Extending results previously known only for unweighted graphs, we provide a limit theorem showing that the point cloud obtained from spectrally embedding the data matrix follows a Gaussian mixture model where each community is represented with an elliptical component. We can therefore formally evaluate how well the communities separate under different data transformations, for example, whether it is productive to "take logs". We find that performance is invariant to affine transformation of the entries, but this expected and desirable feature hinges on adaptively selecting the eigenvectors according to eigenvalue magnitude and using Gaussian clustering. We present a network anomaly detection problem with cyber-security data where the matrix of log p-values, as opposed to p-values, has both theoretical and empirical advantages.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/29/2023

Spectral clustering in the Gaussian mixture block model

Gaussian mixture block models are distributions over graphs that strive ...
research
08/29/2020

Exact Recovery of Community Detection in k-Community Gaussian Mixture Model

We study the community detection problem on a Gaussian mixture model, in...
research
11/09/2020

Spectral clustering on spherical coordinates under the degree-corrected stochastic blockmodel

Spectral clustering is a popular method for community detection in netwo...
research
05/03/2021

Spectral clustering under degree heterogeneity: a case for the random walk Laplacian

This paper shows that graph spectral embedding using the random walk Lap...
research
08/09/2019

Extending the Davis-Kahan theorem for comparing eigenvectors of two symmetric matrices II: Computation and Applications

The extended Davis-Kahan theorem makes use of polynomial matrix transfor...
research
02/05/2021

A simpler spectral approach for clustering in directed networks

We study the task of clustering in directed networks. We show that using...
research
03/22/2020

Spectral Clustering Revisited: Information Hidden in the Fiedler Vector

We are interested in the clustering problem on graphs: it is known that ...

Please sign up or login with your details

Forgot password? Click here to reset