Model selection with lasso-zero: adding straw to the haystack to better find needles

05/14/2018
by   Pascaline Descloux, et al.
0

The high-dimensional linear model y = X β^0 + ϵ is considered and the focus is put on the problem of recovering the support S^0 of the sparse vector β^0. We introduce lasso-zero, a new ℓ_1-based estimator whose novelty resides in an "overfit, then threshold" paradigm and the use of noise dictionaries for overfitting the response. The methodology is supported by theoretical results obtained in the special case where no noise dictionary is used. In this case, lasso-zero boils down to thresholding the basis pursuit solution. We prove that this procedure requires weaker conditions on X and S^0 than the lasso for exact support recovery, and controls the false discovery rate for orthonormal designs when tuned by the quantile universal threshold. However it requires a high signal-to-noise ratio, and the use of noise dictionaries addresses this issue. The threshold selection procedure is based on a pivotal statistic and does not require knowledge of the noise level. Numerical simulations show that lasso-zero performs well in terms of support recovery and provides a good trade-off between high true positive rate and low false discovery rate compared to competitors.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/05/2014

Quantile universal threshold: model selection at the detection edge for high-dimensional linear regression

To estimate a sparse linear model from data with Gaussian noise, consili...
research
03/29/2019

The False Positive Control Lasso

In high dimensional settings where a small number of regressors are expe...
research
01/02/2011

Sparse recovery with unknown variance: a LASSO-type approach

We address the issue of estimating the regression vector β in the generi...
research
02/25/2023

Average case analysis of Lasso under ultra-sparse conditions

We analyze the performance of the least absolute shrinkage and selection...
research
05/12/2020

Robust Lasso-Zero for sparse corruption and model selection with missing covariates

We propose Robust Lasso-Zero, an extension of the Lasso-Zero methodology...
research
11/03/2010

The Lasso under Heteroscedasticity

The performance of the Lasso is well understood under the assumptions of...
research
07/14/2021

On the early solution path of best subset selection

The early solution path, which tracks the first few variables that enter...

Please sign up or login with your details

Forgot password? Click here to reset