Penalized linear regression with high-dimensional pairwise screening

02/08/2019
by   Siliang Gong, et al.
0

In variable selection, most existing screening methods focus on marginal effects and ignore dependence between covariates. To improve the performance of selection, we incorporate pairwise effects in covariates for screening and penalization. We achieve this by studying the asymptotic distribution of the maximal absolute pairwise sample correlation among independent covariates. The novelty of the theory is in that the convergence is with respect to the dimensionality p, and is uniform with respect to the sample size n. Moreover, we obtain an upper bound for the maximal pairwise R squared when regressing the response onto two different covariates. Based on these extreme value results, we propose a screening procedure to detect covariates pairs that are potentially correlated and associated with the response. We further combine the pairwise screening with Sure Independence Screening and develop a new regularized variable selection procedure. Numerical studies show that our method is very competitive in terms of both prediction accuracy and variable selection accuracy.

READ FULL TEXT
research
06/09/2023

Variable screening using factor analysis for high-dimensional data with multicollinearity

Screening methods are useful tools for variable selection in regression ...
research
12/30/2017

An ISIS screening approach involving threshold/partition for variable selection in linear regression

In linear regression, one can select a predictor if the absolute sample ...
research
04/20/2021

Screening methods for linear errors-in-variables models in high dimensions

Microarray studies, in order to identify genes associated with an outcom...
research
04/29/2012

Optimality of Graphlet Screening in High Dimensional Variable Selection

Consider a linear regression model where the design matrix X has n rows ...
research
01/05/2022

High-dimensional variable selection with heterogeneous signals: A precise asymptotic perspective

We study the problem of exact support recovery for high-dimensional spar...
research
03/20/2023

An ADMM approach for multi-response regression with overlapping groups and interaction effects

In this paper, we consider the regularized multi-response regression pro...
research
01/06/2019

Iterated Feature Screening based on Distance Correlation for Ultrahigh-Dimensional Censored Data with Covariates Measurement Error

Feature screening is an important method to reduce the dimension and cap...

Please sign up or login with your details

Forgot password? Click here to reset