Covariate-assisted spectral clustering

11/08/2014
by   Norbert Binkiewicz, et al.
0

Biological and social systems consist of myriad interacting units. The interactions can be represented in the form of a graph or network. Measurements of these graphs can reveal the underlying structure of these interactions, which provides insight into the systems that generated the graphs. Moreover, in applications such as connectomics, social networks, and genomics, graph data are accompanied by contextualizing measures on each node. We utilize these node covariates to help uncover latent communities in a graph, using a modification of spectral clustering. Statistical guarantees are provided under a joint mixture model that we call the node-contextualized stochastic blockmodel, including a bound on the mis-clustering rate. The bound is used to derive conditions for achieving perfect clustering. For most simulated cases, covariate-assisted spectral clustering yields results superior to regularized spectral clustering without node covariates and to an adaptation of canonical correlation analysis. We apply our clustering method to large brain graphs derived from diffusion MRI data, using the node locations or neurological region membership as covariates. In both cases, covariate-assisted spectral clustering yields clusters that are easier to interpret neurologically.

READ FULL TEXT
research
02/11/2018

Covariate-assisted Spectral Clustering in Dynamic Networks

In this paper, we study the community detection problem in the dynamic s...
research
05/17/2022

Perfect Spectral Clustering with Discrete Covariates

Among community detection methods, spectral clustering enjoys two desira...
research
06/05/2018

Understanding Regularized Spectral Clustering via Graph Conductance

This paper uses the relationship between graph conductance and spectral ...
research
04/30/2013

Revealing social networks of spammers through spectral clustering

To date, most studies on spam have focused only on the spamming phase of...
research
01/09/2018

Robust Propensity Score Computation Method based on Machine Learning with Label-corrupted Data

In biostatistics, propensity score is a common approach to analyze the i...
research
05/19/2023

Transfer operators on graphs: Spectral clustering and beyond

Graphs and networks play an important role in modeling and analyzing com...
research
07/24/2020

Scaling Graph Clustering with Distributed Sketches

The unsupervised learning of community structure, in particular the part...

Please sign up or login with your details

Forgot password? Click here to reset