Robust Sparse Mean Estimation via Sum of Squares

06/07/2022
by   Ilias Diakonikolas, et al.
6

We study the problem of high-dimensional sparse mean estimation in the presence of an ϵ-fraction of adversarial outliers. Prior work obtained sample and computationally efficient algorithms for this task for identity-covariance subgaussian distributions. In this work, we develop the first efficient algorithms for robust sparse mean estimation without a priori knowledge of the covariance. For distributions on ℝ^d with "certifiably bounded" t-th moments and sufficiently light tails, our algorithm achieves error of O(ϵ^1-1/t) with sample complexity m = (klog(d))^O(t)/ϵ^2-2/t. For the special case of the Gaussian distribution, our algorithm achieves near-optimal error of Õ(ϵ) with sample complexity m = O(k^4 polylog(d))/ϵ^2. Our algorithms follow the Sum-of-Squares based, proofs to algorithms approach. We complement our upper bounds with Statistical Query and low-degree polynomial testing lower bounds, providing evidence that the sample-time-error tradeoffs achieved by our algorithms are qualitatively the best possible.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/10/2022

List-Decodable Sparse Mean Estimation via Difference-of-Pairs Filtering

We study the problem of list-decodable sparse mean estimation. Specifica...
research
10/22/2021

Polynomial-Time Sum-of-Squares Can Robustly Estimate Mean and Covariance of Gaussians Optimally

In this work, we revisit the problem of estimating the mean and covarian...
research
11/29/2022

Outlier-Robust Sparse Mean Estimation for Heavy-Tailed Distributions

We study the fundamental task of outlier-robust mean estimation for heav...
research
07/30/2018

High-dimensional estimation via sum-of-squares proofs

Estimation is the computational task of recovering a hidden parameter x ...
research
07/22/2021

Learning Sparse Fixed-Structure Gaussian Bayesian Networks

Gaussian Bayesian networks (a.k.a. linear Gaussian structural equation m...
research
11/05/2019

Efficiently Learning Structured Distributions from Untrusted Batches

We study the problem, introduced by Qiao and Valiant, of learning from u...
research
03/08/2018

Learning with Rules

Complex classifiers may exhibit "embarassing" failures in cases that wou...

Please sign up or login with your details

Forgot password? Click here to reset