Graph powering and spectral robustness

09/13/2018
by   Emmanuel Abbe, et al.
0

Spectral algorithms, such as principal component analysis and spectral clustering, typically require careful data transformations to be effective: upon observing a matrix A, one may look at the spectrum of ψ(A) for a properly chosen ψ. The issue is that the spectrum of A might be contaminated by non-informational top eigenvalues, e.g., due to scale` variations in the data, and the application of ψ aims to remove these. Designing a good functional ψ (and establishing what good means) is often challenging and model dependent. This paper proposes a simple and generic construction for sparse graphs, ψ(A) = ((I+A)^r >1), where A denotes the adjacency matrix and r is an integer (less than the graph diameter). This produces a graph connecting vertices from the original graph that are within distance r, and is referred to as graph powering. It is shown that graph powering regularizes the graph and decontaminates its spectrum in the following sense: (i) If the graph is drawn from the sparse Erdős-Rényi ensemble, which has no spectral gap, it is shown that graph powering produces a `maximal' spectral gap, with the latter justified by establishing an Alon-Boppana result for powered graphs; (ii) If the graph is drawn from the sparse SBM, graph powering is shown to achieve the fundamental limit for weak recovery (the KS threshold) similarly to massoulie-STOC, settling an open problem therein. Further, graph powering is shown to be significantly more robust to tangles and cliques than previous spectral algorithms based on self-avoiding or nonbacktracking walk counts massoulie-STOC,Mossel_SBM2,bordenave,colin3. This is illustrated on a geometric block model that is dense in cliques.

READ FULL TEXT
research
06/24/2013

Spectral redemption: clustering sparse networks

Spectral algorithms are classic approaches to clustering and community d...
research
10/12/2022

The Power of Two Matrices in Spectral Algorithms

Spectral algorithms are some of the main tools in optimization and infer...
research
06/18/2020

An Alon-Boppana theorem for powered graphs and generalized Ramanujan graphs

The r-th power of a graph modifies a graph by connecting every vertex pa...
research
03/24/2020

A Distributed Algorithm for Spectral Sparsification of Graphs with Applications to Data Clustering

Spectral sparsification is a technique that is used to reduce the number...
research
02/21/2018

Spectrally approximating large graphs with smaller graphs

How does coarsening affect the spectrum of a general graph? We provide c...
research
06/07/2023

A low rank ODE for spectral clustering stability

Spectral clustering is a well-known technique which identifies k cluster...
research
10/05/2020

Multi-scale graph principal component analysis for connectomics

In brain connectomics, the cortical surface is parcellated into differen...

Please sign up or login with your details

Forgot password? Click here to reset