Envelope Methods with Ignorable Missing Data

03/24/2021
by   Linquan Ma, et al.
0

Envelope method was recently proposed as a method to reduce the dimension of responses in multivariate regressions. However, when there exists missing data, the envelope method using the complete case observations may lead to biased and inefficient results. In this paper, we generalize the envelope estimation when the predictors and/or the responses are missing at random. Specifically, we incorporate the envelope structure in the expectation-maximization (EM) algorithm. As the parameters under the envelope method are not pointwise identifiable, the EM algorithm for the envelope method was not straightforward and requires a special decomposition. Our method is guaranteed to be more efficient, or at least as efficient as, the standard EM algorithm. Moreover, our method has the potential to outperform the full data MLE. We give asymptotic properties of our method under both normal and non-normal cases. The efficiency gain over the standard EM is confirmed in simulation studies and in an application to the Chronic Renal Insufficiency Cohort (CRIC) study.

READ FULL TEXT

page 22

page 24

page 25

research
01/28/2022

A Robust and Flexible EM Algorithm for Mixtures of Elliptical Distributions with Missing Data

This paper tackles the problem of missing data imputation for noisy and ...
research
11/13/2021

A Hybrid EM Algorithm for Linear Two-Way Interactions with Missing Data

We study an EM algorithm for estimating product-term regression models w...
research
03/11/2021

Likelihood-based missing data analysis in multivariate crossover trials

For gene expression data measured in a crossover trial, a multivariate m...
research
01/14/2022

Estimating Gaussian Copulas with Missing Data

In this work we present a rigorous application of the Expectation Maximi...
research
04/11/2020

Handling missing data in a neural network approach for the identification of charged particles in a multilayer detector

Identification of charged particles in a multilayer detector by the ener...
research
10/26/2020

The More Data, the Better? Demystifying Deletion-Based Methods in Linear Regression with Missing Data

We compare two deletion-based methods for dealing with the problem of mi...
research
02/28/2019

Learning partially ranked data based on graph regularization

Ranked data appear in many different applications, including voting and ...

Please sign up or login with your details

Forgot password? Click here to reset