Regression shrinkage and grouping of highly correlated predictors with HORSES

02/01/2013
by   Woncheol Jang, et al.
0

Identifying homogeneous subgroups of variables can be challenging in high dimensional data analysis with highly correlated predictors. We propose a new method called Hexagonal Operator for Regression with Shrinkage and Equality Selection, HORSES for short, that simultaneously selects positively correlated variables and identifies them as predictive clusters. This is achieved via a constrained least-squares problem with regularization that consists of a linear combination of an L_1 penalty for the coefficients and another L_1 penalty for pairwise differences of the coefficients. This specification of the penalty function encourages grouping of positively correlated predictors combined with a sparsity solution. We construct an efficient algorithm to implement the HORSES procedure. We show via simulation that the proposed method outperforms other variable selection methods in terms of prediction error and parsimony. The technique is demonstrated on two data sets, a small data set from analysis of soil in Appalachia, and a high dimensional data set from a near infrared (NIR) spectroscopy study, showing the flexibility of the methodology.

READ FULL TEXT

page 15

page 17

research
06/09/2023

Variable screening using factor analysis for high-dimensional data with multicollinearity

Screening methods are useful tools for variable selection in regression ...
research
01/16/2019

Smooth Adjustment for Correlated Effects

This paper considers a high dimensional linear regression model with cor...
research
06/10/2020

Robust Grouped Variable Selection Using Distributionally Robust Optimization

We propose a Distributionally Robust Optimization (DRO) formulation with...
research
05/11/2020

Ensembled sparse-input hierarchical networks for high-dimensional datasets

Neural networks have seen limited use in prediction for high-dimensional...
research
07/10/2017

An Interactive Greedy Approach to Group Sparsity in High Dimension

Sparsity learning with known grouping structures has received considerab...
research
06/25/2019

Simultaneous Variable Selection, Clustering, and Smoothing in Function on Scalar Regression

We address the problem of multicollinearity in a function-on-scalar regr...
research
08/28/2013

Compound Poisson Processes, Latent Shrinkage Priors and Bayesian Nonconvex Penalization

In this paper we discuss Bayesian nonconvex penalization for sparse lear...

Please sign up or login with your details

Forgot password? Click here to reset