Robust Estimators in High Dimensions without the Computational Intractability

04/21/2016
by   Ilias Diakonikolas, et al.
0

We study high-dimensional distribution learning in an agnostic setting where an adversary is allowed to arbitrarily corrupt an ε-fraction of the samples. Such questions have a rich history spanning statistics, machine learning and theoretical computer science. Even in the most basic settings, the only known approaches are either computationally inefficient or lose dimension-dependent factors in their error guarantees. This raises the following question:Is high-dimensional agnostic distribution learning even possible, algorithmically? In this work, we obtain the first computationally efficient algorithms with dimension-independent error guarantees for agnostically learning several fundamental classes of high-dimensional distributions: (1) a single Gaussian, (2) a product distribution on the hypercube, (3) mixtures of two product distributions (under a natural balancedness condition), and (4) mixtures of spherical Gaussians. Our algorithms achieve error that is independent of the dimension, and in many cases scales nearly-linearly with the fraction of adversarially corrupted samples. Moreover, we develop a general recipe for detecting and correcting corruptions in high-dimensions, that may be applicable to many other problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/23/2016

Robust Learning of Fixed-Structure Bayesian Networks

We investigate the problem of learning Bayesian networks in an agnostic ...
research
04/12/2017

Robustly Learning a Gaussian: Getting Optimal Error, Efficiently

We study the fundamental problem of learning the parameters of a high-di...
research
03/02/2017

Being Robust (in High Dimensions) Can Be Practical

Robust estimation is much more challenging in high dimensions than it is...
research
02/21/2023

Robust Mean Estimation Without a Mean: Dimension-Independent Error in Polynomial Time for Symmetric Distributions

In this work, we study the problem of robustly estimating the mean/locat...
research
04/26/2022

Streaming Algorithms for High-Dimensional Robust Statistics

We study high-dimensional robust statistics tasks in the streaming model...
research
11/19/2019

Outlier-Robust High-Dimensional Sparse Estimation via Iterative Filtering

We study high-dimensional sparse estimation tasks in a robust setting wh...
research
11/03/2019

Testing noisy linear functions for sparsity

We consider the following basic inference problem: there is an unknown h...

Please sign up or login with your details

Forgot password? Click here to reset