Sparse PCA: Algorithms, Adversarial Perturbations and Certificates

11/12/2020
by   Tommaso d'Orsi, et al.
0

We study efficient algorithms for Sparse PCA in standard statistical models (spiked covariance in its Wishart form). Our goal is to achieve optimal recovery guarantees while being resilient to small perturbations. Despite a long history of prior works, including explicit studies of perturbation resilience, the best known algorithmic guarantees for Sparse PCA are fragile and break down under small adversarial perturbations. We observe a basic connection between perturbation resilience and certifying algorithms that are based on certificates of upper bounds on sparse eigenvalues of random matrices. In contrast to other techniques, such certifying algorithms, including the brute-force maximum likelihood estimator, are automatically robust against small adversarial perturbation. We use this connection to obtain the first polynomial-time algorithms for this problem that are resilient against additive adversarial perturbations by obtaining new efficient certificates for upper bounds on sparse eigenvalues of random matrices. Our algorithms are based either on basic semidefinite programming or on its low-degree sum-of-squares strengthening depending on the parameter regimes. Their guarantees either match or approach the best known guarantees of fragile algorithms in terms of sparsity of the unknown vector, number of samples and the ambient dimension. To complement our algorithmic results, we prove rigorous lower bounds matching the gap between fragile and robust polynomial-time algorithms in a natural computational model based on low-degree polynomials (closely related to the pseudo-calibration technique for sum-of-squares lower bounds) that is known to capture the best known guarantees for related statistical estimation problems. The combination of these results provides formal evidence of an inherent price to pay to achieve robustness.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/14/2022

Higher degree sum-of-squares relaxations robust against oblivious outliers

We consider estimation models of the form Y=X^*+N, where X^* is some m-d...
research
10/13/2017

The power of sum-of-squares for detecting hidden structures

We study planted problems---finding hidden structures in random noisy in...
research
02/20/2023

Sparse PCA Beyond Covariance Thresholding

In the Wishart model for sparse PCA we are given n samples Y_1,…, Y_n dr...
research
11/04/2019

Lifting Sum-of-Squares Lower Bounds: Degree-2 to Degree-4

The degree-4 Sum-of-Squares (SoS) SDP relaxation is a powerful algorithm...
research
03/26/2023

Convergence rates for sums-of-squares hierarchies with correlative sparsity

This work derives upper bounds on the convergence rate of the moment-sum...
research
05/25/2015

Statistical and Algorithmic Perspectives on Randomized Sketching for Ordinary Least-Squares -- ICML

We consider statistical and algorithmic aspects of solving large-scale l...
research
10/15/2019

A greedy anytime algorithm for sparse PCA

The taxing computational effort that is involved in solving some high-di...

Please sign up or login with your details

Forgot password? Click here to reset