Simulation-Selection-Extrapolation: Estimation in High-Dimensional Errors-in-Variables Models

08/30/2018
by   Linh Nghiem, et al.
0

This paper considers errors-in-variables models in a high-dimensional setting where the number of covariates can be much larger than the sample size, and there are only a small number of non-zero covariates. The presence of measurement error in the covariates can result in severely biased parameter estimates, and also affects the ability of penalized methods such as the lasso to recover the true sparsity pattern. A new estimation procedure called SIMSELEX (SIMulation-SELection-EXtrapolation) is proposed. This procedure augments the traditional SIMEX approach with a variable selection step based on the group lasso. The SIMSELEX estimator is shown to perform well in variable selection, and has significantly lower estimation error than naive estimators that ignore measurement error. SIMSELEX can be applied in a variety of errors-in-variables settings, including linear models, generalized linear models, and Cox survival models. It is furthermore shown how SIMSELEX can be applied to spline-based regression models. SIMSELEX estimators are compared to the corrected lasso and the conic programming estimator for a linear model, and to the conditional scores lasso for a logistic regression model. Finally, the method is used to analyze a microarray dataset that contains gene expression measurements of favorable histology Wilms tumors.

READ FULL TEXT
research
12/26/2019

A Simple Correction Procedure for High-Dimensional Generalized Linear Models with Measurement Error

We consider high-dimensional generalized linear models when the covariat...
research
04/20/2021

Screening methods for linear errors-in-variables models in high dimensions

Microarray studies, in order to identify genes associated with an outcom...
research
01/09/2017

MEBoost: Variable Selection in the Presence of Measurement Error

We present a novel method for variable selection in regression models wh...
research
02/23/2018

Variable selection via Group LASSO Approach : Application to the Cox Regression and frailty model

In the analysis of survival outcome supplemented with both clinical info...
research
10/26/2022

High-dimensional Measurement Error Models for Lipschitz Loss

Recently emerging large-scale biomedical data pose exciting opportunitie...
research
04/09/2021

Measurement Errors in Semiparametric Generalized Linear Models

Regression models that ignore measurement error in predictors may produc...
research
05/07/2023

Provable Identifiability of Two-Layer ReLU Neural Networks via LASSO Regularization

LASSO regularization is a popular regression tool to enhance the predict...

Please sign up or login with your details

Forgot password? Click here to reset