Being Robust (in High Dimensions) Can Be Practical

03/02/2017
by   Ilias Diakonikolas, et al.
0

Robust estimation is much more challenging in high dimensions than it is in one dimension: Most techniques either lead to intractable optimization problems or estimators that can tolerate only a tiny fraction of errors. Recent work in theoretical computer science has shown that, in appropriate distributional models, it is possible to robustly estimate the mean and covariance with polynomial time algorithms that can tolerate a constant fraction of corruptions, independent of the dimension. However, the sample and time complexity of these algorithms is prohibitively large for high-dimensional applications. In this work, we address both of these issues by establishing sample complexity bounds that are optimal, up to logarithmic factors, as well as giving various refinements that allow the algorithms to tolerate a much larger fraction of corruptions. Finally, we show on both synthetic and real data that our algorithms have state-of-the-art performance and suddenly make high-dimensional robust estimation a realistic possibility.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/09/2022

Robustness Implies Privacy in Statistical Estimation

We study the relationship between adversarial robustness and differentia...
research
04/21/2016

Robust Estimators in High Dimensions without the Computational Intractability

We study high-dimensional distribution learning in an agnostic setting w...
research
11/14/2019

Recent Advances in Algorithmic High-Dimensional Robust Statistics

Learning in the presence of outliers is a fundamental problem in statist...
research
04/12/2017

Robustly Learning a Gaussian: Getting Optimal Error, Efficiently

We study the fundamental problem of learning the parameters of a high-di...
research
08/15/2023

Effective Continued Fraction Dimension versus Effective Hausdorff Dimension of Reals

We establish that constructive continued fraction dimension originally d...
research
08/21/2020

Robust Mean Estimation in High Dimensions via ℓ_0 Minimization

We study the robust mean estimation problem in high dimensions, where α ...
research
06/09/2013

Minimax Theory for High-dimensional Gaussian Mixtures with Sparse Mean Separation

While several papers have investigated computationally and statistically...

Please sign up or login with your details

Forgot password? Click here to reset