Conformal Prediction with Missing Values

06/05/2023
by   Margaux Zaffran, et al.
0

Conformal prediction is a theoretically grounded framework for constructing predictive intervals. We study conformal prediction with missing values in the covariates – a setting that brings new challenges to uncertainty quantification. We first show that the marginal coverage guarantee of conformal prediction holds on imputed data for any missingness distribution and almost all imputation functions. However, we emphasize that the average coverage varies depending on the pattern of missing values: conformal methods tend to construct prediction intervals that under-cover the response conditionally to some missing patterns. This motivates our novel generalized conformalized quantile regression framework, missing data augmentation, which yields prediction intervals that are valid conditionally to the patterns of missing values, despite their exponential number. We then show that a universally consistent quantile regression algorithm trained on the imputed data is Bayes optimal for the pinball risk, thus achieving valid coverage conditionally to any given data point. Moreover, we examine the case of a linear model, which demonstrates the importance of our proposal in overcoming the heteroskedasticity induced by missing values. Using synthetic and data from critical care, we corroborate our theory and report improved performance of our methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/08/2019

Conformalized Quantile Regression

Conformal prediction is a technique for constructing prediction interval...
research
06/28/2023

UTOPIA: Universally Trainable Optimal Prediction Intervals Aggregation

Uncertainty quantification for prediction is an intriguing problem with ...
research
03/07/2022

On the Construction of Distribution-Free Prediction Intervals for an Image Regression Problem in Semiconductor Manufacturing

The high-volume manufacturing of the next generation of semiconductor de...
research
02/12/2020

Estimating Uncertainty Intervals from Collaborating Networks

Effective decision making requires understanding the uncertainty inheren...
research
06/22/2022

Sharing pattern submodels for prediction with missing values

Missing values are unavoidable in many applications of machine learning ...
research
02/13/2021

Variable importance scores

Scoring of variables for importance in predicting a response is an ill-d...
research
05/20/2022

Conformal Prediction with Temporal Quantile Adjustments

We develop Temporal Quantile Adjustment (TQA), a general method to const...

Please sign up or login with your details

Forgot password? Click here to reset