Efficient Algorithm for Extremely Large Multi-task Regression with Massive Structured Sparsity

08/15/2012
by   Seunghak Lee, et al.
0

We develop a highly scalable optimization method called "hierarchical group-thresholding" for solving a multi-task regression model with complex structured sparsity constraints on both input and output spaces. Despite the recent emergence of several efficient optimization algorithms for tackling complex sparsity-inducing regularizers, true scalability in practical high-dimensional problems where a huge amount (e.g., millions) of sparsity patterns need to be enforced remains an open challenge, because all existing algorithms must deal with ALL such patterns exhaustively in every iteration, which is computationally prohibitive. Our proposed algorithm addresses the scalability problem by screening out multiple groups of coefficients simultaneously and systematically. We employ a hierarchical tree representation of group constraints to accelerate the process of removing irrelevant constraints by taking advantage of the inclusion relationships between group sparsities, thereby avoiding dealing with all constraints in every optimization step, and necessitating optimization operation only on a small number of outstanding coefficients. In our experiments, we demonstrate the efficiency of our method on simulation datasets, and in an application of detecting genetic variants associated with gene expression traits.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/09/2012

Structured Input-Output Lasso, with Application to eQTL Mapping, and a Thresholding Algorithm for Fast Estimation

We consider the problem of learning a high-dimensional multi-task regres...
research
09/08/2009

Tree-guided group lasso for multi-response regression with structured sparsity, with an application to eQTL mapping

We consider the problem of estimating a sparse multi-response regression...
research
02/14/2012

Smoothing Proximal Gradient Method for General Structured Sparse Learning

We study the problem of learning high dimensional regression models regu...
research
02/19/2016

Structured Sparse Regression via Greedy Hard-Thresholding

Several learning applications require solving high-dimensional regressio...
research
08/23/2023

Less is More – Towards parsimonious multi-task models using structured sparsity

Group sparsity in Machine Learning (ML) encourages simpler, more interpr...
research
05/07/2016

A Bayesian Group Sparse Multi-Task Regression Model for Imaging Genetics

Motivation: Recent advances in technology for brain imaging and high-thr...

Please sign up or login with your details

Forgot password? Click here to reset