Variance estimation in pseudo-expected estimating equations for missing data

04/21/2022
by   Giorgos Bakoyannis, et al.
0

Missing data is a common challenge in biomedical research. This fact, along with growing dataset volumes of the modern era, make the issue of computationally-efficient analysis with missing data of crucial practical importance. A general computationally-efficient estimation framework for dealing with missing data is the pseudo-expected estimating equations (PEEE) approach. The method is applicable with any parametric model for which estimation involves the solution of a set of estimating equations, such as likelihood score equations. A key limitation of the PEEE approach is that there is currently no closed-form variance estimator, and variance estimation requires the computationally burdensome bootstrap method. In this work, we address the gap and provide a closed-form variance estimator whose computation can be significantly faster than a bootstrap approach. Our variance estimator is shown to be consistent even with auxiliary variables and under misspecified models for the incomplete variables. Simulation studies show that our variance estimator performs well and that its computation can be over 50 times faster than the bootstrap. The computational efficiency gain from our proposed variance estimator is crucial with large datasets or when the main analysis method is computationally intensive. Finally, the PEEE approach along with our variance estimator are used to analyze incomplete electronic health record data of patients with traumatic brain injury.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/19/2023

Empirical sandwich variance estimator for iterated conditional expectation g-computation

Iterated conditional expectation (ICE) g-computation is an estimation ap...
research
07/05/2022

Handling Nonmonotone Missing Data with Available Complete-Case Missing Value Assumption

Nonmonotone missing data is a common problem in scientific studies. The ...
research
07/02/2023

A Note on Ising Network Analysis with Missing Data

The Ising model has become a popular psychometric model for analyzing it...
research
09/17/2018

Statistically and Computationally Efficient Variance Estimator for Kernel Ridge Regression

In this paper, we propose a random projection approach to estimate varia...
research
09/12/2012

Likelihood Estimation with Incomplete Array Variate Observations

Missing data is an important challenge when dealing with high dimensiona...
research
01/12/2018

A Simple and Efficient Estimation Method for Models with Nonignorable Missing Data

This paper proposes a simple and efficient estimation procedure for the ...
research
05/24/2018

Model-based inference of conditional extreme value distributions with hydrological applications

Multivariate extreme value models are used to estimate joint risk in a n...

Please sign up or login with your details

Forgot password? Click here to reset