Better subset regression

12/04/2012
by   Shifeng Xiong, et al.
0

To find efficient screening methods for high dimensional linear regression models, this paper studies the relationship between model fitting and screening performance. Under a sparsity assumption, we show that a subset that includes the true submodel always yields smaller residual sum of squares (i.e., has better model fitting) than all that do not in a general asymptotic setting. This indicates that, for screening important variables, we could follow a "better fitting, better screening" rule, i.e., pick a "better" subset that has better model fitting. To seek such a better subset, we consider the optimization problem associated with best subset regression. An EM algorithm, called orthogonalizing subset screening, and its accelerating version are proposed for searching for the best subset. Although the two algorithms cannot guarantee that a subset they yield is the best, their monotonicity property makes the subset have better model fitting than initial subsets generated by popular screening methods, and thus the subset can have better screening performance asymptotically. Simulation results show that our methods are very competitive in high dimensional variable screening even for finite sample sizes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/07/2014

Faithful Variable Screening for High-Dimensional Convex Regression

We study the problem of variable selection in convex nonparametric regre...
research
12/05/2011

On best subset regression

In this paper we discuss the variable selection method from ℓ0-norm cons...
research
04/20/2021

Screening methods for linear errors-in-variables models in high dimensions

Microarray studies, in order to identify genes associated with an outcom...
research
07/21/2021

Bayesian iterative screening in ultra-high dimensional settings

Variable selection in ultra-high dimensional linear regression is often ...
research
07/03/2020

When is best subset selection the "best"?

Best subset selection (BSS) is fundamental in statistics and machine lea...
research
02/10/2019

Iterative Least Trimmed Squares for Mixed Linear Regression

Given a linear regression setting, Iterative Least Trimmed Squares (ILTS...
research
09/14/2018

Are screening methods useful in feature selection? An empirical study

Filter or screening methods are often used as a preprocessing step for r...

Please sign up or login with your details

Forgot password? Click here to reset