The Backbone Method for Ultra-High Dimensional Sparse Machine Learning

06/11/2020
by   Dimitris Bertsimas, et al.
0

We present the backbone method, a generic framework that enables sparse and interpretable supervised machine learning methods to scale to ultra-high dimensional problems. We solve, in minutes, sparse regression problems with p∼10^7 features and decision tree induction problems with p∼10^5 features. The proposed method operates in two phases; we first determine the backbone set, that consists of potentially relevant features, by solving a number of tractable subproblems; then, we solve a reduced problem, considering only the backbone features. Numerical experiments demonstrate that our method competes with optimal solutions, when exact methods apply, and substantially outperforms baseline heuristics, when exact methods do not scale, both in terms of recovering the true relevant features and in its out-of-sample predictive performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/27/2014

Large-scale Online Feature Selection for Ultra-high Dimensional Sparse Data

Feature selection with large-scale high-dimensional data is important ye...
research
04/17/2020

Sparse Regression at Scale: Branch-and-Bound rooted in First-Order Optimization

We consider the least squares regression problem, penalized with a combi...
research
02/07/2022

Effects of Parametric and Non-Parametric Methods on High Dimensional Sparse Matrix Representations

The semantics are derived from textual data that provide representations...
research
09/28/2017

Sparse Hierarchical Regression with Polynomials

We present a novel method for exact hierarchical sparse polynomial regre...
research
08/14/2016

Ultra High-Dimensional Nonlinear Feature Selection for Big Biological Data

Machine learning methods are used to discover complex nonlinear relation...
research
09/28/2017

Sparse High-Dimensional Regression: Exact Scalable Algorithms and Phase Transitions

We present a novel binary convex reformulation of the sparse regression ...
research
06/23/2020

SWAG: A Wrapper Method for Sparse Learning

Predictive power has always been the main research focus of learning alg...

Please sign up or login with your details

Forgot password? Click here to reset