Understanding Regularized Spectral Clustering via Graph Conductance

06/05/2018
by   Yilin Zhang, et al.
0

This paper uses the relationship between graph conductance and spectral clustering to study (i) the failures of spectral clustering and (ii) the benefits of regularization. The explanation is simple. Sparse and stochastic graphs create a lot of small trees that are connected to the core of the graph by only one edge. Graph conductance is sensitive to these noisy `dangling sets'. Spectral clustering inherits this sensitivity. The second part of the paper starts from a previously proposed form of regularized spectral clustering and shows that it is related to the graph conductance on a `regularized graph'. We call the conductance on the regularized graph CoreCut. Based upon previous arguments that relate graph conductance to spectral clustering (e.g. Cheeger inequality), minimizing CoreCut relaxes to regularized spectral clustering. Simple inspection of CoreCut reveals why it is less sensitive to small cuts in the graph. Together, these results show that unbalanced partitions from spectral clustering can be understood as overfitting to noise in the periphery of a sparse and stochastic graph. Regularization fixes this overfitting. In addition to this statistical benefit, these results also demonstrate how regularization can improve the computational speed of spectral clustering. We provide simulations and data examples to illustrate these results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/08/2020

Robust spectral clustering using LASSO regularization

Cluster structure detection is a fundamental task for the analysis of gr...
research
12/23/2019

Spectral embedding of regularized block models

Spectral embedding is a popular technique for the representation of grap...
research
12/05/2013

Impact of regularization on Spectral Clustering

The performance of spectral clustering can be considerably improved via ...
research
06/08/2017

Clustering with t-SNE, provably

t-distributed Stochastic Neighborhood Embedding (t-SNE), a clustering an...
research
12/03/2019

Optimal Laplacian regularization for sparse spectral community detection

Regularization of the classical Laplacian matrices was empirically shown...
research
11/08/2014

Covariate-assisted spectral clustering

Biological and social systems consist of myriad interacting units. The i...
research
01/08/2020

A Group Norm Regularized LRR Factorization Model for Spectral Clustering

Spectral clustering is a very important and classic graph clustering met...

Please sign up or login with your details

Forgot password? Click here to reset