High-dimensional inference robust to outliers with l1-norm penalization

by   Jad Beyhum, et al.

This paper studies inference in the high-dimensional linear regression model with outliers. Sparsity constraints are imposed on the vector of coefficients of the covariates. The number of outliers can grow with the sample size while their proportion goes to 0. We propose a two-step procedure for inference on the coefficients of a fixed subset of regressors. The first step is a based on several square-root lasso l1-norm penalized estimators, while the second step is the ordinary least squares estimator applied to a well chosen regression. We establish asymptotic normality of the two-step estimator. The proposed procedure is efficient in the sense that it attains the semiparametric efficiency bound when applied to the model without outliers under homoscedasticity. This approach is also computationally advantageous, it amounts to solving a finite number of convex optimization programs.


page 1

page 2

page 3

page 4


Inference robust to outliers with l1-norm penalization

This paper considers the problem of inference in a linear regression mod...

Robust censored regression with l1-norm regularization

This paper considers inference in a linear regression model with random ...

Robust Nonparametric Regression via Sparsity Control with Application to Load Curve Data Cleansing

Nonparametric methods are widely applicable to statistical inference pro...

Robust-to-outliers square-root LASSO, simultaneous inference with a MOM approach

We consider the least-squares regression problem with unknown noise vari...

Outlier-robust estimation of a sparse linear model using ℓ_1-penalized Huber's M-estimator

We study the problem of estimating a p-dimensional s-sparse vector in a ...

Inference In General Single-Index Models Under High-dimensional Symmetric Designs

We consider the problem of statistical inference for a finite number of ...

Matrix optimization based Euclidean embedding with outliers

Euclidean embedding from noisy observations containing outlier errors is...