List Decodable Subspace Recovery

02/07/2020
by   Prasad Raghavendra, et al.
0

Learning from data in the presence of outliers is a fundamental problem in statistics. In this work, we study robust statistics in the presence of overwhelming outliers for the fundamental problem of subspace recovery. Given a dataset where an α fraction (less than half) of the data is distributed uniformly in an unknown k dimensional subspace in d dimensions, and with no additional assumptions on the remaining data, the goal is to recover a succinct list of O(1/α) subspaces one of which is nontrivially correlated with the planted subspace. We provide the first polynomial time algorithm for the 'list decodable subspace recovery' problem, and subsume it under a more general framework of list decoding over distributions that are "certifiably resilient" capturing state of the art results for list decodable mean estimation and regression.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/20/2020

List Decodable Mean Estimation in Nearly Linear Time

Learning from data in the presence of outliers is a fundamental problem ...
research
03/02/2018

An Overview of Robust Subspace Recovery

This paper will serve as an introduction to the body of work on robust s...
research
04/05/2019

Robust Subspace Recovery with Adversarial Outliers

We study the problem of robust subspace recovery (RSR) in the presence o...
research
02/12/2020

List-Decodable Subspace Recovery via Sum-of-Squares

We give the first efficient algorithm for the problem of list-decodable ...
research
01/22/2022

Implicit Bias of Projected Subgradient Method Gives Provable Robust Recovery of Subspaces of Unknown Codimension

Robust subspace recovery (RSR) is a fundamental problem in robust repres...
research
03/14/2017

Subspace Learning in The Presence of Sparse Structured Outliers and Noise

Subspace learning is an important problem, which has many applications i...
research
12/18/2010

lp-Recovery of the Most Significant Subspace among Multiple Subspaces with Outliers

We assume data sampled from a mixture of d-dimensional linear subspaces ...

Please sign up or login with your details

Forgot password? Click here to reset