Evaluation of approaches for accommodating interactions and non-linear terms in multiple imputation of incomplete three-level data

10/30/2020
by   Rushani Wijesuriya, et al.
0

Three-level data structures arising from repeated measures on individuals clustered within larger units are common in health research studies. Missing data are prominent in such studies and are often handled via multiple imputation (MI). Although several MI approaches can be used to account for the three-level structure, including adaptations to single- and two-level approaches, when the substantive analysis model includes interactions or quadratic effects these too need to be accommodated in the imputation model. In such analyses, substantive model compatible (SMC) MI has shown great promise in the context of single-level data. While there have been recent developments in multilevel SMC MI, to date only one approach that explicitly handles incomplete three-level data is available. Alternatively, researchers can use pragmatic adaptations to single- and two-level MI approaches, or two-level SMC-MI approaches. We describe the available approaches and evaluate them via simulation in the context of a three three-level random effects analysis models involving an interaction between the incomplete time-varying exposure and time, an interaction between the time-varying exposure and an incomplete time-fixed confounder, or a quadratic effect of the exposure. Results showed that all approaches considered performed well in terms of bias and precision when the target analysis involved an interaction with time, but the three-level SMC MI approach performed best when the target analysis involved an interaction between the time-varying exposure and an incomplete time-fixed confounder, or a quadratic effect of the exposure. We illustrate the methods using data from the Childhood to Adolescence Transition Study.

READ FULL TEXT

page 26

page 28

page 29

page 30

page 31

page 32

page 33

page 34

research
04/13/2022

Infinite Hidden Markov Models for Multiple Multivariate Time Series with Missing Data

Exposure to air pollution is associated with increased morbidity and mor...
research
12/10/2021

Handling missing data when estimating causal effects with Targeted Maximum Likelihood Estimation

Causal inference from longitudinal studies is central to epidemiologic r...
research
01/27/2023

G-formula for causal inference via multiple imputation

G-formula is a popular approach for estimating treatment or exposure eff...
research
05/04/2021

Considerations for using reproduction data in toxicokinetic-toxicodynamic modelling

Toxicokinetic-toxicodynamic (TKTD) modelling is essential to make sense ...
research
07/15/2021

Optimal-Design Domain-Adaptation for Exposure Prediction in Two-Stage Epidemiological Studies

In the first stage of a two-stage study, the researcher uses a statistic...
research
08/02/2023

Model Selection for Exposure-Mediator Interaction

In mediation analysis, the exposure often influences the mediating effec...
research
08/25/2023

Generative Bayesian modeling to nowcast the effective reproduction number from line list data with missing symptom onset dates

The time-varying effective reproduction number R_t is a widely used indi...

Please sign up or login with your details

Forgot password? Click here to reset