Structured Sparse Regression via Greedy Hard-Thresholding

02/19/2016
by   Prateek Jain, et al.
0

Several learning applications require solving high-dimensional regression problems where the relevant features belong to a small number of (overlapping) groups. For very large datasets and under standard sparsity constraints, hard thresholding methods have proven to be extremely efficient, but such methods require NP hard projections when dealing with overlapping groups. In this paper, we show that such NP-hard projections can not only be avoided by appealing to submodular optimization, but such methods come with strong theoretical guarantees even in the presence of poorly conditioned data (i.e. say when two features have correlation ≥ 0.99), which existing analyses cannot handle. These methods exhibit an interesting computation-accuracy trade-off and can be extended to significantly harder problems such as sparse overlapping groups. Experiments on both real and synthetic data validate our claims and demonstrate that the proposed methods are orders of magnitude faster than other greedy and convex relaxation techniques for learning with group-structured sparsity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/22/2013

Gradient Hard Thresholding Pursuit for Sparsity-Constrained Optimization

Hard Thresholding Pursuit (HTP) is an iterative greedy selection procedu...
research
02/18/2014

Classification with Sparse Overlapping Groups

Classification with a sparsity constraint on the solution plays a centra...
research
12/02/2019

Efficient Relaxed Gradient Support Pursuit for Sparsity Constrained Non-convex Optimization

Large-scale non-convex sparsity-constrained problems have recently gaine...
research
08/15/2012

Efficient Algorithm for Extremely Large Multi-task Regression with Massive Structured Sparsity

We develop a highly scalable optimization method called "hierarchical gr...
research
09/30/2016

Technical Report: Graph-Structured Sparse Optimization for Connected Subgraph Detection

Structured sparse optimization is an important and challenging problem f...
research
01/20/2020

Generalization Bounds for High-dimensional M-estimation under Sparsity Constraint

The ℓ_0-constrained empirical risk minimization (ℓ_0-ERM) is a promising...
research
07/12/2016

Information Projection and Approximate Inference for Structured Sparse Variables

Approximate inference via information projection has been recently intro...

Please sign up or login with your details

Forgot password? Click here to reset