Sparse High-Dimensional Regression: Exact Scalable Algorithms and Phase Transitions

09/28/2017
by   Dimitris Bertsimas, et al.
0

We present a novel binary convex reformulation of the sparse regression problem that constitutes a new duality perspective. We devise a new cutting plane method and provide evidence that it can solve to provable optimality the sparse regression problem for sample sizes n and number of regressors p in the 100,000s, that is two orders of magnitude better than the current state of the art, in seconds. The ability to solve the problem for very high dimensions allows us to observe new phase transition phenomena. Contrary to traditional complexity theory which suggests that the difficulty of a problem increases with problem size, the sparse regression problem has the property that as the number of samples n increases the problem becomes easier in that the solution recovers 100 fast (in fact faster than Lasso), while for small number of samples n, our approach takes a larger amount of time to solve the problem, but importantly the optimal solution provides a statistically more relevant regressor. We argue that our exact sparse regression approach presents a superior alternative over heuristic methods available at present.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/28/2017

Sparse Hierarchical Regression with Polynomials

We present a novel method for exact hierarchical sparse polynomial regre...
research
04/01/2022

On Distributed Exact Sparse Linear Regression over Networks

In this work, we propose an algorithm for solving exact sparse linear re...
research
09/11/2018

A new exact algorithm for solving single machine scheduling problems with learning effects and deteriorating jobs

In this paper, the single machine scheduling problem with deteriorating ...
research
04/13/2023

OKRidge: Scalable Optimal k-Sparse Ridge Regression for Learning Dynamical Systems

We consider an important problem in scientific discovery, identifying sp...
research
10/30/2019

Iterative Hessian Sketch in Input Sparsity Time

Scalable algorithms to solve optimization and regression tasks even appr...
research
10/13/2017

Enumerating Multiple Equivalent Lasso Solutions

Predictive modelling is a data-analysis task common in many scientific f...
research
06/11/2020

The Backbone Method for Ultra-High Dimensional Sparse Machine Learning

We present the backbone method, a generic framework that enables sparse ...

Please sign up or login with your details

Forgot password? Click here to reset