Robust Spectral Detection of Global Structures in the Data by Learning a Regularization

09/09/2016
by   Pan Zhang, et al.
0

Spectral methods are popular in detecting global structures in the given data that can be represented as a matrix. However when the data matrix is sparse or noisy, classic spectral methods usually fail to work, due to localization of eigenvectors (or singular vectors) induced by the sparsity or noise. In this work, we propose a general method to solve the localization problem by learning a regularization matrix from the localized eigenvectors. Using matrix perturbation analysis, we demonstrate that the learned regularizations suppress down the eigenvalues associated with localized eigenvectors and enable us to recover the informative eigenvectors representing the global structure. We show applications of our method in several inference problems: community detection in networks, clustering from pairwise similarities, rank estimation and matrix completion problems. Using extensive experiments, we illustrate that our method solves the localization problem and works down to the theoretical detectability limits in different kinds of synthetic data. This is in contrast with existing spectral algorithms based on data matrix, non-backtracking matrix, Laplacians and those with rank-one regularizations, which perform poorly in the sparse case with noise.

READ FULL TEXT
research
12/03/2015

Bayesian Matrix Completion via Adaptive Relaxed Spectral Regularization

Bayesian matrix completion has been studied based on a low-rank matrix f...
research
06/24/2013

Spectral redemption: clustering sparse networks

Spectral algorithms are classic approaches to clustering and community d...
research
11/03/2016

Spectral community detection in heterogeneous large networks

In this article, we study spectral methods for community detection based...
research
08/02/2016

Global Vertices and the Noising Paradox

A theoretical and experimental analysis related to the identification of...
research
11/03/2010

Performance Analysis of Spectral Clustering on Compressed, Incomplete and Inaccurate Measurements

Spectral clustering is one of the most widely used techniques for extrac...
research
05/31/2014

Optimization via Low-rank Approximation for Community Detection in Networks

Community detection is one of the fundamental problems of network analys...
research
12/15/2020

Spectral Methods for Data Science: A Statistical Perspective

Spectral methods have emerged as a simple yet surprisingly effective app...

Please sign up or login with your details

Forgot password? Click here to reset