Solar: a least-angle regression for accurate and stable variable selection in high-dimensional data

07/30/2020
by   Ning Xu, et al.
5

We propose a new least-angle regression algorithm for variable selection in high-dimensional data, called subsample-ordered least-angle regression (solar). Solar relies on the average L_0 solution path computed across subsamples and largely alleviates several known high-dimensional issues with least-angle regression. Using examples based on directed acyclic graphs, we illustrate the advantages of solar in comparison to least-angle regression, forward regression and variable screening. Simulations demonstrate that, with a similar computation load, solar yields substantial improvements over two lasso solvers (least-angle regression for lasso and coordinate-descent) in terms of the sparsity (37-64% reduction in the average number of selected variables), stability and accuracy of variable selection. Simulations also demonstrate that solar enhances the robustness of variable selection to different settings of the irrepresentable condition and to variations in the dependence structures assumed in regression analysis. We provide a Python package solarpy for the algorithm.

READ FULL TEXT

page 11

page 24

page 28

page 35

research
07/30/2020

Accuracy and stability of solar variable selection comparison under complicated dependence structures

In this paper we focus on the variable-selection peformance of solar on ...
research
02/07/2008

Least angle and ℓ_1 penalized regression: A review

Least Angle Regression is a promising technique for variable selection a...
research
12/21/2020

A critical review of LASSO and its derivatives for variable selection under dependence among covariates

We study the limitations of the well known LASSO regression as a variabl...
research
10/27/2022

Exhuming nonnegative garrote from oblivion using suitable initial estimates- illustration in low and high-dimensional real data

The nonnegative garrote (NNG) is among the first approaches that combine...
research
01/20/2023

Optimization of body configuration and joint-driven attitude stabilization for transformable spacecrafts under solar radiation pressure

A solar sail is one of the most promising space exploration system becau...
research
04/29/2012

Optimality of Graphlet Screening in High Dimensional Variable Selection

Consider a linear regression model where the design matrix X has n rows ...
research
08/02/2018

High-dimensional regression in practice: an empirical study of finite-sample prediction, variable selection and ranking

Penalized likelihood methods are widely used for high-dimensional regres...

Please sign up or login with your details

Forgot password? Click here to reset