Post-Lasso Inference for High-Dimensional Regression

06/16/2018
by   X. Jessie Jeng, et al.
0

Among the most popular variable selection procedures in high-dimensional regression, Lasso provides a solution path to rank the variables and determines a cut-off position on the path to select variables and estimate coefficients. In this paper, we consider variable selection from a new perspective motivated by the frequently occurred phenomenon that relevant variables are not completely distinguishable from noise variables on the solution path. We propose to characterize the positions of the first noise variable and the last relevant variable on the path. We then develop a new variable selection procedure to control over-selection of the noise variables ranking after the last relevant variable, and, at the same time, retain a high proportion of relevant variables ranking before the first noise variable. Our procedure utilizes the recently developed covariance test statistic and Q statistic in post-selection inference. In numerical examples, our method compares favorably with other existing methods in selection accuracy and the ability to interpret its results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/09/2018

Efficient Predictor Ranking and False Discovery Proportion Control in High-Dimensional Regression

We propose a ranking and selection procedure to prioritize relevant pred...
research
02/28/2019

Granger Causality Testing in High-Dimensional VARs: a Post-Double-Selection Procedure

In this paper we develop an LM test for Granger causality in high-dimens...
research
07/30/2020

A Power Analysis for Knockoffs with the Lasso Coefficient-Difference Statistic

In a linear model with possibly many predictors, we consider variable se...
research
04/20/2018

Variable Selection via Adaptive False Negative Control in High-Dimensional Regression

In high-dimensional regression, variable selection methods have been dev...
research
09/22/2020

The Linear Lasso: a location model resolution

We use location model methodology to guide the least squares analysis of...
research
05/25/2022

Resampling-Based Multisplit Inference for High-Dimensional Regression

We propose a novel resampling-based method to construct an asymptoticall...
research
08/10/2017

When Does the First Spurious Variable Get Selected by Sequential Regression Procedures?

Applied statisticians use sequential regression procedures to produce a ...

Please sign up or login with your details

Forgot password? Click here to reset