Scalable Interpretable Learning for Multi-Response Error-in-Variables Regression

05/11/2020
by   J. Wu, et al.
0

Corrupted data sets containing noisy or missing observations are prevalent in various contemporary applications such as economics, finance and bioinformatics. Despite the recent methodological and algorithmic advances in high-dimensional multi-response regression, how to achieve scalable and interpretable estimation under contaminated covariates is unclear. In this paper, we develop a new methodology called convex conditioned sequential sparse learning (COSS) for error-in-variables multi-response regression under both additive measurement errors and random missing data. It combines the strengths of the recently developed sequential sparse factor regression and the nearest positive semi-definite matrix projection, thus enjoying stepwise convexity and scalability in large-scale association analyses. Comprehensive theoretical guarantees are provided and we demonstrate the effectiveness of the proposed methodology through numerical studies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/10/2020

Low-rank matrix estimation in multi-response regression with measurement errors: Statistical and computational guarantees

In this paper, we investigate the matrix estimation problem in the multi...
research
10/14/2019

Measurement error as a missing data problem

This article focuses on measurement error in covariates in regression an...
research
01/12/2013

Robust High Dimensional Sparse Regression and Matching Pursuit

We consider high dimensional sparse regression, and develop strategies a...
research
02/26/2018

Missing Data in Sparse Transition Matrix Estimation for Sub-Gaussian Vector Autoregressive Processes

High-dimensional time series data exist in numerous areas such as financ...
research
09/16/2011

High-dimensional regression with noisy and missing data: Provable guarantees with nonconvexity

Although the standard formulations of prediction problems involve fully-...
research
03/03/2020

Nonlinear Functional Output Regression: a Dictionary Approach

Many applications in signal processing involve data that consists in a h...
research
03/17/2020

Statistically Guided Divide-and-Conquer for Sparse Factorization of Large Matrix

The sparse factorization of a large matrix is fundamental in modern stat...

Please sign up or login with your details

Forgot password? Click here to reset