The correlation-assisted missing data estimator

11/05/2019
by   Timothy I. Cannings, et al.
0

We introduce a novel approach to estimation problems in settings with missing data. Our proposal - the Correlation-Assisted Missing data (CAM) estimator - works by exploiting the relationship between the observations with missing features and those without missing features in order to obtain improved prediction accuracy. In particular, our theoretical results elucidate general conditions under which the proposed CAM estimator has lower mean squared error than the widely used complete-case approach in a range of estimation problems. We showcase in detail how the CAM estimator can be applied to U-Statistics to obtain an unbiased, asymptotically Gaussian estimator that has lower variance than the complete-case U-Statistic. Further, in nonparametric density estimation and regression problems, we construct our CAM estimator using kernel functions, and show it has lower asymptotic mean-squared-error than the corresponding complete-case kernel estimator. We also include practical demonstrations using the Terneuzen birth cohort and Brandsma datasets available from CRAN. Finally, our proposal is shown to outperform popular imputation methods in a simulation study.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/15/2021

Statistical inference using Regularized M-estimation in the reproducing kernel Hilbert space for handling missing data

Imputation and propensity score weighting are two popular techniques for...
research
01/03/2017

New Methods of Enhancing Prediction Accuracy in Linear Models with Missing Data

In this paper, prediction for linear systems with missing information is...
research
11/02/2022

Small area estimation using multiple imputation in three-parameter logistic models

We propose a novel methodology relating item response theory methods wit...
research
08/12/2019

Prediction in regression models with continuous observations

We consider the problem of predicting values of a random process or fiel...
research
02/23/2020

On a complete and sufficient statistic for the correlated Bernoulli random graph model

Inference on vertex-aligned graphs is of wide theoretical and practical ...
research
05/20/2020

Smooth Distribution Function Estimation for Lifetime Distributions using Szasz-Mirakyan Operators

In this paper, we introduce a new smooth estimator for continuous distri...
research
03/21/2014

Missing Data Prediction and Classification: The Use of Auto-Associative Neural Networks and Optimization Algorithms

This paper presents methods which are aimed at finding approximations to...

Please sign up or login with your details

Forgot password? Click here to reset