Optimality and Sub-optimality of PCA for Spiked Random Matrices and Synchronization

09/19/2016
by   Amelia Perry, et al.
0

A central problem of random matrix theory is to understand the eigenvalues of spiked random matrix models, in which a prominent eigenvector is planted into a random matrix. These distributions form natural statistical models for principal component analysis (PCA) problems throughout the sciences. Baik, Ben Arous and Péché showed that the spiked Wishart ensemble exhibits a sharp phase transition asymptotically: when the signal strength is above a critical threshold, it is possible to detect the presence of a spike based on the top eigenvalue, and below the threshold the top eigenvalue provides no information. Such results form the basis of our understanding of when PCA can detect a low-rank signal in the presence of noise. However, not all the information about the spike is necessarily contained in the spectrum. We study the fundamental limitations of statistical methods, including non-spectral ones. Our results include: I) For the Gaussian Wigner ensemble, we show that PCA achieves the optimal detection threshold for a variety of benign priors for the spike. We extend previous work on the spherically symmetric and i.i.d. Rademacher priors through an elementary, unified analysis. II) For any non-Gaussian Wigner ensemble, we show that PCA is always suboptimal for detection. However, a variant of PCA achieves the optimal threshold (for benign priors) by pre-transforming the matrix entries according to a carefully designed function. This approach has been stated before, and we give a rigorous and general analysis. III) For both the Gaussian Wishart ensemble and various synchronization problems over groups, we show that inefficient procedures can work below the threshold where PCA succeeds, whereas no known efficient algorithm achieves this. This conjectural gap between what is statistically possible and what can be done efficiently remains open.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/02/2018

Optimality and Sub-optimality of PCA I: Spiked Random Matrix Models

A central problem of random matrix theory is to understand the eigenvalu...
research
01/12/2023

Detection problems in the spiked matrix models

We study the statistical decision process of detecting the low-rank sign...
research
06/25/2018

Fundamental limits of detection in the spiked Wigner model

We study the fundamental limits of detecting the presence of an additive...
research
04/28/2021

Detection of Signal in the Spiked Rectangular Models

We consider the problem of detecting signals in the rank-one signal-plus...
research
06/24/2020

An ℓ_p theory of PCA and spectral clustering

Principal Component Analysis (PCA) is a powerful tool in statistics and ...
research
06/03/2021

Efficient ℤ_2 synchronization on ℤ^d under symmetry-preserving side information

We consider ℤ_2-synchronization on the Euclidean lattice. Every vertex o...
research
08/02/2021

A Random Matrix Perspective on Random Tensors

Tensor models play an increasingly prominent role in many fields, notabl...

Please sign up or login with your details

Forgot password? Click here to reset