Robust Sub-Gaussian Principal Component Analysis and Width-Independent Schatten Packing

06/12/2020
by   Arun Jambulapati, et al.
0

We develop two methods for the following fundamental statistical task: given an ϵ-corrupted set of n samples from a d-dimensional sub-Gaussian distribution, return an approximate top eigenvector of the covariance matrix. Our first robust PCA algorithm runs in polynomial time, returns a 1 - O(ϵlogϵ^-1)-approximate top eigenvector, and is based on a simple iterative filtering approach. Our second, which attains a slightly worse approximation factor, runs in nearly-linear time and sample complexity under a mild spectral gap assumption. These are the first polynomial-time algorithms yielding non-trivial information about the covariance of a corrupted sub-Gaussian distribution without requiring additional algebraic structure of moments. As a key technical tool, we develop the first width-independent solvers for Schatten-p norm packing semidefinite programs, giving a (1 + ϵ)-approximate solution in O(plog(ndϵ)ϵ^-1) input-sparsity time iterations (where n, d are problem dimensions).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/23/2018

High-Dimensional Robust Mean Estimation in Nearly-Linear Time

We study the fundamental problem of high-dimensional mean estimation in ...
research
06/11/2019

Faster Algorithms for High-Dimensional Robust Covariance Estimation

We study the problem of estimating the covariance matrix of a high-dimen...
research
06/23/2020

Approximation Algorithms for Sparse Principal Component Analysis

We present three provably accurate, polynomial time, approximation algor...
research
02/12/2020

Positive Semidefinite Programming: Mixed, Parallel, and Width-Independent

We give the first approximation algorithm for mixed packing and covering...
research
12/09/2021

Estimating the Longest Increasing Subsequence in Nearly Optimal Time

Longest Increasing Subsequence (LIS) is a fundamental statistic of a seq...
research
04/04/2017

Polynomial Time and Sample Complexity for Non-Gaussian Component Analysis: Spectral Methods

The problem of Non-Gaussian Component Analysis (NGCA) is about finding a...
research
11/29/2019

Adversarially Robust Low Dimensional Representations

Adversarial or test time robustness measures the susceptibility of a mac...

Please sign up or login with your details

Forgot password? Click here to reset