Supervised dimensionality reduction for multiple imputation by chained equations

09/04/2023
by   Edoardo Costantini, et al.
0

Multivariate imputation by chained equations (MICE) is one of the most popular approaches to address missing values in a data set. This approach requires specifying a univariate imputation model for every variable under imputation. The specification of which predictors should be included in these univariate imputation models can be a daunting task. Principal component analysis (PCA) can simplify this process by replacing all of the potential imputation model predictors with a few components summarizing their variance. In this article, we extend the use of PCA with MICE to include a supervised aspect whereby information from the variables under imputation is incorporated into the principal component estimation. We conducted an extensive simulation study to assess the statistical properties of MICE with different versions of supervised dimensionality reduction and we compared them with the use of classical unsupervised PCA as a simpler dimensionality reduction technique.

READ FULL TEXT
research
05/10/2023

Blockwise Principal Component Analysis for monotone missing data imputation and dimensionality reduction

Monotone missing data is a common problem in data analysis. However, imp...
research
06/30/2022

Solving the "many variables" problem in MICE with principal component regression

Multiple Imputation (MI) is one of the most popular approaches to addres...
research
07/02/2017

Dimensionality reduction with missing values imputation

In this study, we propose a new statical approach for high-dimensionalit...
research
05/06/2020

Stochastic Bottleneck: Rateless Auto-Encoder for Flexible Dimensionality Reduction

We propose a new concept of rateless auto-encoders (RL-AEs) that enable ...
research
08/29/2022

High-dimensional imputation for the social sciences: a comparison of state-of-the-art methods

Including a large number of predictors in the imputation model underlyin...
research
12/09/2020

Spatial noise-aware temperature retrieval from infrared sounder data

In this paper we present a combined strategy for the retrieval of atmosp...
research
07/15/2023

Supervised Dynamic PCA: Linear Dynamic Forecasting with Many Predictors

This paper proposes a novel dynamic forecasting method using a new super...

Please sign up or login with your details

Forgot password? Click here to reset