Model Selection for High-Dimensional Regression under the Generalized Irrepresentability Condition

05/02/2013
by   Adel Javanmard, et al.
0

In the high-dimensional regression model a response variable is linearly related to p covariates, but the sample size n is smaller than p. We assume that only a small subset of covariates is `active' (i.e., the corresponding coefficients are non-zero), and consider the model-selection problem of identifying the active covariates. A popular approach is to estimate the regression coefficients through the Lasso (ℓ_1-regularized least squares). This is known to correctly identify the active set only if the irrelevant covariates are roughly orthogonal to the relevant ones, as quantified through the so called `irrepresentability' condition. In this paper we study the `Gauss-Lasso' selector, a simple two-stage method that first solves the Lasso, and then performs ordinary least squares restricted to the Lasso active set. We formulate `generalized irrepresentability condition' (GIC), an assumption that is substantially weaker than irrepresentability. We prove that, under GIC, the Gauss-Lasso correctly recovers the active set.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/20/2019

Omitted variable bias of Lasso-based inference methods under limited variability: A finite sample analysis

We study the finite sample behavior of Lasso and Lasso-based inference m...
research
02/11/2008

On the ℓ_1-ℓ_q Regularized Regression

In this paper we consider the problem of grouped variable selection in h...
research
12/10/2020

Optimal selection of a common subset of covariates for different regressions

Given a regression dataset of size n, most of the classical model select...
research
03/23/2019

Bayesian Factor-adjusted Sparse Regression

This paper investigates the high-dimensional linear regression with high...
research
12/13/2018

On the sign recovery given by the thresholded LASSO and thresholded Basis Pursuit

We consider the regression model, when the number of observations is sma...
research
08/05/2008

Support union recovery in high-dimensional multivariate regression

In multivariate regression, a K-dimensional response vector is regressed...
research
06/19/2018

Simultaneous Signal Subspace Rank and Model Selection with an Application to Single-snapshot Source Localization

This paper proposes a novel method for model selection in linear regress...

Please sign up or login with your details

Forgot password? Click here to reset