Robust methods for high-dimensional linear learning

08/10/2022
βˆ™
by   Ibrahim Merad, et al.
βˆ™
0
βˆ™

We propose statistically robust and computationally efficient linear learning methods in the high-dimensional batch setting, where the number of features d may exceed the sample size n. We employ, in a generic learning setting, two algorithms depending on whether the considered loss function is gradient-Lipschitz or not. Then, we instantiate our framework on several applications including vanilla sparse, group-sparse and low-rank matrix recovery. This leads, for each application, to efficient and robust learning algorithms, that reach near-optimal estimation rates under heavy-tailed distributions and the presence of outliers. For vanilla s-sparsity, we are able to reach the slog (d)/n rate under heavy-tails and Ξ·-corruption, at a computational cost comparable to that of non-robust analogs. We provide an efficient implementation of our algorithms in an open-source π™Ώπš’πšπš‘πš˜πš— library called πš•πš’πš—πš•πšŽπšŠπš›πš—, by means of which we carry out numerical experiments which confirm our theoretical findings together with a comparison to other recent approaches proposed in the literature.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
βˆ™ 03/02/2022

Computationally Efficient and Statistically Optimal Robust Low-rank Matrix and Tensor Estimation

Low-rank matrix estimation under heavy-tailed noise is challenging, both...
research
βˆ™ 05/10/2023

Computationally Efficient and Statistically Optimal Robust High-Dimensional Linear Regression

High-dimensional linear regression under heavy-tailed noise or outlier c...
research
βˆ™ 11/19/2019

Outlier-Robust High-Dimensional Sparse Estimation via Iterative Filtering

We study high-dimensional sparse estimation tasks in a robust setting wh...
research
βˆ™ 11/29/2022

Outlier-Robust Sparse Mean Estimation for Heavy-Tailed Distributions

We study the fundamental task of outlier-robust mean estimation for heav...
research
βˆ™ 07/01/2019

A Unified Approach to Robust Mean Estimation

In this paper, we develop connections between two seemingly disparate, b...
research
βˆ™ 10/16/2018

High-dimensional Varying Index Coefficient Models via Stein's Identity

We study the parameter estimation problem for a single-index varying coe...
research
βˆ™ 03/15/2019

A nonasymptotic law of iterated logarithm for robust online estimators

In this paper, we provide tight deviation bounds for M-estimators, which...

Please sign up or login with your details

Forgot password? Click here to reset