A logic-based resampling with matching approach to multiple imputation of missing data

04/14/2020
by   Chinchin Wang, et al.
0

Researchers often use model-based multiple imputation to handle missing at random data to minimize bias while making the best use of all available data. However, there are contexts where it is very difficult to fit a model due to constraints amongst variables, and using a generic regression imputation model may result in implausible values. We explore the advantages of employing a logic-based resampling with matching (RWM) approach for multiple imputation. This approach is similar to random hot deck imputation, and allows for more plausible imputations than model-based approaches. We illustrate a RWM approach for multiply imputing missing pain, activity frequency, and sport data using The Childhood Health, Activity, and Motor Performance School Study Denmark (CHAMPS-DK). We match records with missing data to several observed records, generate probabilities for matched records using observed data, and sample from these records based on the probability of each occurring. Because imputed values are generated randomly, multiple complete datasets can be created. They are then analyzed and averaged in the same way as model-based multiple imputation. This approach can be extended to other datasets as an alternative to model-based approaches, particularly where there are time-dependent ordered categorical variables or other constraints between variables.

READ FULL TEXT

page 25

page 26

page 27

page 28

page 29

page 30

page 31

research
03/02/2021

Multiple imputation with missing data indicators

Multiple imputation is a well-established general technique for analyzin...
research
03/29/2019

Statistical matching of non-Gaussian data

The statistical matching problem is a data integration problem with stru...
research
04/14/2018

Simultaneous Edit and Imputation for Household Data with Structural Zeros

Multivariate categorical data nested within households often include rep...
research
11/11/2020

Multiple Imputation for Nonignorable Item Nonresponse in Complex Surveys Using Auxiliary Margin

We outline a framework for multiple imputation of nonignorable item nonr...
research
05/04/2018

Population-calibrated multiple imputation for a binary/categorical covariate in categorical regression models

Multiple imputation (MI) has become popular for analyses with missing da...
research
09/24/2020

MatchThem:: Matching and Weighting after Multiple Imputation

Balancing the distributions of the confounders across the exposure level...

Please sign up or login with your details

Forgot password? Click here to reset