Debiased Lasso After Sample Splitting for Estimation and Inference in High Dimensional Generalized Linear Models

02/28/2023
by   Omar Vazquez, et al.
0

We consider random sample splitting for estimation and inference in high dimensional generalized linear models, where we first apply the lasso to select a submodel using one subsample and then apply the debiased lasso to fit the selected model using the remaining subsample. We show that, no matter including a prespecified subset of regression coefficients or not, the debiased lasso estimation of the selected submodel after a single splitting follows a normal distribution asymptotically. Furthermore, for a set of prespecified regression coefficients, we show that a multiple splitting procedure based on the debiased lasso can address the loss of efficiency associated with sample splitting and produce asymptotically normal estimates under mild conditions. Our simulation results indicate that using the debiased lasso instead of the standard maximum likelihood estimator in the estimation stage can vastly reduce the bias and variance of the resulting estimates. We illustrate the proposed multiple splitting debiased lasso method with an analysis of the smoking data of the Mid-South Tobacco Case-Control Study.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/29/2022

A new procedure for Selective Inference with the Generalized Linear Lasso

This articles investigates the distribution of the solutions of the gene...
research
03/14/2015

Communication-efficient sparse regression: a one-shot approach

We devise a one-shot approach to distributed sparse regression in the hi...
research
01/29/2020

Adaptive Estimation and Statistical Inference for High-Dimensional Graph-Based Linear Models

We consider adaptive estimation and statistical inference for high-dimen...
research
03/11/2019

Estimation and Inference for High Dimensional Generalized Linear Models: A Splitting and Smoothing Approach

For a better understanding of the molecular causes of lung cancer, the B...
research
08/04/2019

Method of Contraction-Expansion (MOCE) for Simultaneous Inference in Linear Models

Simultaneous inference after model selection is of critical importance t...
research
10/23/2020

Design of c-Optimal Experiments for High dimensional Linear Models

We study random designs that minimize the asymptotic variance of a de-bi...
research
10/25/2019

Boosting heritability: estimating the genetic component of phenotypic variation with multiple sample splitting

Heritability is a central measure in genetics quantifying how much of th...

Please sign up or login with your details

Forgot password? Click here to reset