Enumerating Multiple Equivalent Lasso Solutions

10/13/2017
by   Yannis Pantazis, et al.
0

Predictive modelling is a data-analysis task common in many scientific fields. However, it is rather unknown that multiple predictive models can be equally well-performing for the same problem. This multiplicity often leads to poor reproducibility when searching for a unique solution in datasets with low number of samples, high dimensional feature space and/or high levels of noise, a common scenario in biology and medicine. The Lasso regression is one of the most powerful and popular regularization methods, yet it also produces a single, sparse solution. In this paper, we show that nearly-optimal Lasso solutions, whose out-of-sample statistical error is practically indistinguishable from the optimal one, exist. We formalize various notions of equivalence between Lasso solutions, and we devise an algorithm to enumerate the ones that are equivalent in a statistical sense: we define a tolerance on the root mean square error (RMSE) which creates a RMSE-equivalent Lasso solution space. Results in both regression and classification tasks reveal that the out-of-sample error due to the RMSE relaxation is within the range of the statistical error due to the sampling size.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/27/2017

Generalized Concomitant Multi-Task Lasso for sparse multimodal regression

In high dimension, it is customary to consider Lasso-type estimators to ...
research
06/15/2018

Statistical Inference with Ensemble of Clustered Desparsified Lasso

Medical imaging involves high-dimensional data, yet their acquisition is...
research
10/14/2018

Convex Hull Approximation of Nearly Optimal Lasso Solutions

In an ordinary feature selection procedure, a set of important features ...
research
03/22/2016

Localized Lasso for High-Dimensional Regression

We introduce the localized Lasso, which is suited for learning models th...
research
12/14/2022

On LASSO for High Dimensional Predictive Regression

In a high dimensional linear predictive regression where the number of p...
research
05/11/2016

Asymptotic equivalence of regularization methods in thresholded parameter space

High-dimensional data analysis has motivated a spectrum of regularizatio...
research
09/28/2017

Sparse High-Dimensional Regression: Exact Scalable Algorithms and Phase Transitions

We present a novel binary convex reformulation of the sparse regression ...

Please sign up or login with your details

Forgot password? Click here to reset