Parallel integrative learning for large-scale multi-response regression with incomplete outcomes

04/11/2021
by   Ruipeng Dong, et al.
0

Multi-task learning is increasingly used to investigate the association structure between multiple responses and a single set of predictor variables in many applications. In the era of big data, the coexistence of incomplete outcomes, large number of responses, and high dimensionality in predictors poses unprecedented challenges in estimation, prediction, and computation. In this paper, we propose a scalable and computationally efficient procedure, called PEER, for large-scale multi-response regression with incomplete outcomes, where both the numbers of responses and predictors can be high-dimensional. Motivated by sparse factor regression, we convert the multi-response regression into a set of univariate-response regressions, which can be efficiently implemented in parallel. Under some mild regularity conditions, we show that PEER enjoys nice sampling properties including consistency in estimation, prediction, and variable selection. Extensive simulation studies show that our proposal compares favorably with several existing methods in estimation accuracy, variable selection, and computation efficiency.

READ FULL TEXT
research
01/14/2021

Structured Bayesian variable selection for multiple related response variables and high-dimensional predictors

It is becoming increasingly common to study the complex association betw...
research
05/11/2016

Interaction pursuit in high-dimensional multi-response regression via distance correlation

Feature interactions can contribute to a large proportion of variation i...
research
11/08/2018

A global-local approach for detecting hotspots in multiple-response regression

We tackle modelling and inference for variable selection in regression p...
research
05/02/2023

Slow Kill for Big Data Learning

Big-data applications often involve a vast number of observations and fe...
research
02/18/2021

A Generative Approach to Joint Modeling of Quantitative and Qualitative Responses

In many scientific areas, data with quantitative and qualitative (QQ) re...
research
02/11/2020

Computationally efficient univariate filtering for massive data

The vast availability of large scale, massive and big data has increased...
research
07/24/2019

Comparison of Multi-response Estimation Methods

Prediction performance does not always reflect the estimation behaviour ...

Please sign up or login with your details

Forgot password? Click here to reset