High-dimensional robust regression and outliers detection with SLOPE

12/07/2017
by   Alain Virouleau, et al.
0

The problems of outliers detection and robust regression in a high-dimensional setting are fundamental in statistics, and have numerous applications. Following a recent set of works providing methods for simultaneous robust regression and outliers detection, we consider in this paper a model of linear regression with individual intercepts, in a high-dimensional setting. We introduce a new procedure for simultaneous estimation of the linear regression coefficients and intercepts, using two dedicated sorted-ℓ_1 penalizations, also called SLOPE. We develop a complete theory for this problem: first, we provide sharp upper bounds on the statistical estimation error of both the vector of individual intercepts and regression coefficients. Second, we give an asymptotic control on the False Discovery Rate (FDR) and statistical power for support selection of the individual intercepts. As a consequence, this paper is the first to introduce a procedure with guaranteed FDR and statistical power control for outliers detection under the mean-shift model. Numerical illustrations, with a comparison to recent alternative approaches, are provided on both simulated and several real-world datasets. Experiments are conducted using an open-source software written in Python and C++.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/28/2015

Robust Gaussian Graphical Modeling with the Trimmed Graphical Lasso

Gaussian Graphical Models (GGMs) are popular tools for studying network ...
research
09/28/2020

Some exact results for the statistical physics problem of high-dimensional linear regression

High-dimensional linear regression have become recently a subject of man...
research
03/21/2022

Delicatessen: M-Estimation in Python

M-estimation is a general statistical approach that simplifies and unifi...
research
08/12/2022

Sparse change detection in high-dimensional linear regression

We introduce a new methodology 'charcoal' for estimating the location of...
research
12/18/2018

Robust functional ANOVA model with t-process

Robust estimation approaches are of fundamental importance for statistic...
research
11/14/2019

Recent Advances in Algorithmic High-Dimensional Robust Statistics

Learning in the presence of outliers is a fundamental problem in statist...
research
07/08/2020

Sparse Regression for Extreme Values

We study the problem of selecting features associated with extreme value...

Please sign up or login with your details

Forgot password? Click here to reset