A flexible and efficient algorithm for joint imputation of general data

08/05/2020
by   Michael W. Robbins, et al.
0

Imputation of data with general structures (e.g., data with continuous, binary, unordered categorical, and ordinal variables) is commonly performed with fully conditional specification (FCS) instead of joint modeling. A key drawback of FCS is that it does not invoke an appropriate data augmentation mechanism and as such convergence of the resulting Markov chain Monte Carlo procedure is not assured. Methods that use joint modeling lack these drawbacks but have not been efficiently implemented in data of general structures. We address these issues by developing a new method, coherent multivariate imputation (CMI), that draws imputations from a latent joint multivariate normal model that underpins the generally structured data. This model is constructed using a sequence of flexible conditional linear models that enables the resulting procedure to be efficiently implemented on high dimensional datasets in practice. Simulations show that CMI performs well when compared to those that utilize FCS. Furthermore, the new method is dramatically more computationally efficient than FCS procedures.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/16/2022

Semiparametric imputation using latent sparse conditional Gaussian mixtures for multivariate mixed outcomes

This paper proposes a flexible Bayesian approach to multiple imputation ...
research
05/13/2022

Semiparametric Gaussian Copula Regression modeling for Mixed Data Types (SGCRM)

Many clinical and epidemiological studies encode collected participant-l...
research
12/04/2022

Convergence Analysis of Data Augmentation Algorithms for Bayesian Robust Multivariate Linear Regression with Incomplete Data

Gaussian mixtures are commonly used for modeling heavy-tailed error dist...
research
12/16/2021

Sensitivity Analysis of the MCRF Model to Different Transiogram Joint Modeling Methods for Simulating Categorical Spatial Variables

Markov chain geostatistics is a methodology for simulating categorical f...
research
08/27/2022

Joint distribution properties of Fully Conditional Specification under the normal linear model with normal inverse-gamma priors

Fully conditional specification (FCS) is a convenient and flexible multi...
research
01/12/2018

Multiple Imputation: A Review of Practical and Theoretical Findings

Multiple imputation is a straightforward method for handling missing dat...
research
03/29/2019

Statistical matching of non-Gaussian data

The statistical matching problem is a data integration problem with stru...

Please sign up or login with your details

Forgot password? Click here to reset