On the sign recovery given by the thresholded LASSO and thresholded Basis Pursuit

12/13/2018
by   Patrick J. C. Tardivel, et al.
0

We consider the regression model, when the number of observations is smaller than the number of explicative variables. It is well known that the popular Least Absolute Shrinkage and Selection Operator (LASSO) can recover the sign of regression coefficients only if a very stringent irrepresentable condition is satisfied. In this article, in a first step, we provide a new result about the irrepresentable condition: the probability to recover the sign with LASSO is smaller than 1/2 once the irrepresentable condition does not hold. Next, we revisit properties of thresholded LASSO and provide new theoretical results in the asymptotic setup under which the design matrix is fixed and the magnitudes of nonzero regression coefficients tend to infinity. Apart from LASSO, our results cover also basis pursuit, which can be thought of as a limiting case of LASSO when the tuning parameter tends to 0. Compared to the classical asymptotics, our approach allows for reduction of the technical burden. We formulate an easy identifiability condition which turns out to be sufficient and necessary for thresholded LASSO to recover the sign of the sufficiently large signal. Our simulation study illustrates the difference between the irrepresentable and the identifiability condition. Interestingly, while irrepresentable condition becomes more difficult to be satisfied for strongly correlated designs, it does not seem to be true for identifiability condition. Actually, when the correlations are positive and the nonzero coefficients are of the same sign, the identifiability condition allows the number of nonzero coefficients to be larger than in case where the regressors are independent.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/22/2022

Pattern recovery by SLOPE

LASSO and SLOPE are two popular methods for dimensionality reduction in ...
research
05/02/2013

Model Selection for High-Dimensional Regression under the Generalized Irrepresentability Condition

In the high-dimensional regression model a response variable is linearly...
research
03/17/2015

Improved LASSO

We propose an improved LASSO estimation technique based on Stein-rule. W...
research
08/01/2022

The Effect of Omitted Variables on the Sign of Regression Coefficients

Omitted variables are a common concern in empirical research. We show th...
research
04/20/2020

The Geometry of Uniqueness and Model Selection of Penalized Estimators including SLOPE, LASSO, and Basis Pursuit

We provide a necessary and sufficient condition for the uniqueness of pe...
research
03/29/2023

Optimal Supersaturated Designs for Lasso Sign Recovery

Supersaturated designs, in which the number of factors exceeds the numbe...
research
02/25/2023

Average case analysis of Lasso under ultra-sparse conditions

We analyze the performance of the least absolute shrinkage and selection...

Please sign up or login with your details

Forgot password? Click here to reset