Robust variable screening for regression using factor profiling

11/27/2017
by   Yixin Wang, et al.
0

Sure Independence Screening is a fast procedure for variable selection in ultra-high dimensional regression analysis. Unfortunately, its performance greatly deteriorates with increasing dependence among the predictors. To solve this issue, Factor Profiled Sure Independence Screening (FPSIS) models the correlation structure of the predictor variables, assuming that it can be represented by a few latent factors. The correlations can then be profiled out by projecting the data onto the orthogonal complement of the subspace spanned by these factors. However, neither of these methods can handle the presence of outliers in the data. Therefore, we propose a robust screening method which uses least trimmed squares principal component analysis to estimate the latent factors and the factor profiled variables. Variable screening is then performed on factor profiled variables by using regression MM-estimators. Different types of outliers in this model and their roles in variable screening are studied. Both simulation studies and a real data analysis show that the proposed robust procedure has good performance on clean data and outperforms the two nonrobust methods on contaminated data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/30/2020

A robust variable screening procedure for ultra-high dimensional data

Variable selection in ultra-high dimensional regression problems has bec...
research
06/05/2015

High-dimensional Ordinary Least-squares Projection for Screening Variables

Variable selection is a challenging issue in statistical applications wh...
research
10/03/2022

Factor-Augmented Regularized Model for Hazard Regression

A prevalent feature of high-dimensional data is the dependence among cov...
research
11/28/2017

Nonparametric Independence Screening via Favored Smoothing Bandwidth

We propose a flexible nonparametric regression method for ultrahigh-dime...
research
05/05/2023

On the use of ordered factors as explanatory variables

Consider a regression or some regression-type model for a certain respon...
research
12/14/2017

Fast robust correlation for high dimensional data

The product moment covariance is a cornerstone of multivariate data anal...
research
05/26/2022

Factor selection in screening experiments by aggregation over random models

Screening experiments are useful for screening out a small number of tru...

Please sign up or login with your details

Forgot password? Click here to reset