Estimation, Confidence Intervals, and Large-Scale Hypotheses Testing for High-Dimensional Mixed Linear Regression

11/06/2020
by   Linjun Zhang, et al.
13

This paper studies the high-dimensional mixed linear regression (MLR) where the output variable comes from one of the two linear regression models with an unknown mixing proportion and an unknown covariance structure of the random covariates. Building upon a high-dimensional EM algorithm, we propose an iterative procedure for estimating the two regression vectors and establish their rates of convergence. Based on the iterative estimators, we further construct debiased estimators and establish their asymptotic normality. For individual coordinates, confidence intervals centered at the debiased estimators are constructed. Furthermore, a large-scale multiple testing procedure is proposed for testing the regression coefficients and is shown to control the false discovery rate (FDR) asymptotically. Simulation studies are carried out to examine the numerical performance of the proposed methods and their superiority over existing methods. The proposed methods are further illustrated through an analysis of a dataset of multiplex image cytometry, which investigates the interaction networks among the cellular phenotypes that include the expression levels of 20 epitopes or combinations of markers.

READ FULL TEXT
research
05/17/2018

Global and Simultaneous Hypothesis Testing for High-Dimensional Logistic Regression Models

High-dimensional logistic regression is widely used in analyzing data wi...
research
01/15/2018

Robust Inference for Seemingly Unrelated Regression Models

Seemingly unrelated regression models generalize linear regression model...
research
04/08/2021

A New Perspective on Debiasing Linear Regressions

In this paper, we propose an abstract procedure for debiasing constraine...
research
06/07/2021

Semi-Supervised Statistical Inference for High-Dimensional Linear Regression with Blockwise Missing Data

Blockwise missing data occurs frequently when we integrate multisource o...
research
11/27/2020

Two-sample testing of high-dimensional linear regression coefficients via complementary sketching

We introduce a new method for two-sample testing of high-dimensional lin...
research
07/03/2023

Reliever: Relieving the Burden of Costly Model Fits for Changepoint Detection

We propose a general methodology Reliever for fast and reliable changepo...
research
01/10/2018

Generalized Linear Models with Linear Constraints for Microbiome Compositional Data

Motivated by regression analysis for microbiome compositional data, this...

Please sign up or login with your details

Forgot password? Click here to reset