A Power Analysis for Knockoffs with the Lasso Coefficient-Difference Statistic

07/30/2020
by   Asaf Weinstein, et al.
0

In a linear model with possibly many predictors, we consider variable selection procedures given by {1≤ j≤ p: |β_j(λ)| > t}, where β(λ) is the Lasso estimate of the regression coefficients, and where λ and t may be data dependent. Ordinary Lasso selection is captured by using t=0, thus allowing to control only λ, whereas thresholded-Lasso selection allows to control both λ and t. The potential advantages of the latter over the former in terms of power—figuratively, opening up the possibility to look further down the Lasso path—have been quantified recently leveraging advances in approximate message-passing (AMP) theory, but the implications are actionable only when assuming substantial knowledge of the underlying signal. In this work we study theoretically the power of a knockoffs-calibrated counterpart of thresholded-Lasso that enables us to control FDR in the realistic situation where no prior information about the signal is available. Although the basic AMP framework remains the same, our analysis requires a significant technical extension of existing theory in order to handle the pairing between original variables and their knockoffs. Relying on this extension we obtain exact asymptotic predictions for the true positive proportion achievable at a prescribed type I error level. In particular, we show that the knockoffs version of thresholded-Lasso can perform much better than ordinary Lasso selection if λ is chosen by cross-validation on the augmented matrix.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/16/2018

Post-Lasso Inference for High-Dimensional Regression

Among the most popular variable selection procedures in high-dimensional...
research
11/05/2015

False Discoveries Occur Early on the Lasso Path

In regression settings where explanatory variables have very low correla...
research
09/23/2013

Asymptotic Analysis of LASSOs Solution Path with Implications for Approximate Message Passing

This paper concerns the performance of the LASSO (also knows as basis pu...
research
04/09/2018

Efficient Predictor Ranking and False Discovery Proportion Control in High-Dimensional Regression

We propose a ranking and selection procedure to prioritize relevant pred...
research
04/02/2019

An Adapted Geographically Weighted Lasso(Ada-GWL) model for estimating metro ridership

Ridership estimation at station level plays a critical role in metro tra...
research
07/31/2023

LASSO extension: using the number of non-zero coefficients to test the global model hypothesis

In this paper, we propose a test procedure based on the LASSO methodolog...
research
06/04/2018

Post model-fitting exploration via a "Next-Door" analysis

We propose a simple method for evaluating the model that has been chosen...

Please sign up or login with your details

Forgot password? Click here to reset