Prediction with Missing Data

04/07/2021
by   Dimitris Bertsimas, et al.
0

Missing information is inevitable in real-world data sets. While imputation is well-suited and theoretically sound for statistical inference, its relevance and practical implementation for out-of-sample prediction remains unsettled. We provide a theoretical analysis of widely used data imputation methods and highlight their key deficiencies in making accurate predictions. Alternatively, we propose adaptive linear regression, a new class of models that can be directly trained and evaluated on partially observed data, adapting to the set of available features. In particular, we show that certain adaptive regression models are equivalent to impute-then-regress methods where the imputation and the regression models are learned simultaneously instead of sequentially. We validate our theoretical findings and adaptive regression approach with numerical results with real-world data sets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/02/2019

UAFS: Uncertainty-Aware Feature Selection for Problems with Missing Data

Missing data are a concern in many real world data sets and imputation m...
research
01/22/2020

Multiple imputation in functional regression with applications to EEG data in a depression study

Methods for estimating parameters in functional regression models requir...
research
11/17/2015

Optimized Linear Imputation

Often in real-world datasets, especially in high dimensional data, some ...
research
11/09/2019

Missing Features Reconstruction and Its Impact on Classification Accuracy

In real-world applications, we can encounter situations when a well-trai...
research
06/29/2023

Understanding Pathologies of Deep Heteroskedastic Regression

Several recent studies have reported negative results when using heteros...
research
07/27/2019

Bayesian Robustness: A Nonasymptotic Viewpoint

We study the problem of robustly estimating the posterior distribution f...
research
06/03/2022

Estimation of Over-parameterized Models via Fitting to Future Observations

From a model-building perspective, in this paper we propose a paradigm s...

Please sign up or login with your details

Forgot password? Click here to reset