Conformal prediction for exponential families and generalized linear models

05/09/2019
by   Daniel J. Eck, et al.
0

Conformal prediction methods construct prediction regions for iid data that are valid in finite samples. Distribution-free conformal prediction methods have been proposed for regression. Generalized linear models (GLMs) are a widely used class of regression models, and researchers often seek predictions from fitted GLMs. We provide a parametric conformal prediction region for GLMs that possesses finite sample validity and is asymptotically of minimal length when the model is correctly specified. This parametric conformal prediction region is asymptotically minimal at the √((n)/n) rate when the dimension d of the predictor is one or two, and converges at the O{((n)/n)^1/d} rate when d > 2. We develop a novel concentration inequality for maximum likelihood estimation in exponential families that induces these convergence rates. We analyze prediction region coverage properties, large-sample efficiency, and robustness properties of four methods for constructing conformal prediction intervals for GLMs: fully nonparametric kernel-based conformal, residual based conformal, normalized residual based conformal, and parametric conformal which uses the assumed GLM density as a conformity measure. Extensive simulations compare these approaches to standard asymptotic prediction regions. The utility of the parametric conformal prediction region is demonstrated in an application to interval prediction of glycosylated hemoglobin levels, a blood measurement used to diagnose diabetes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/02/2020

Conformal prediction intervals for the individual treatment effect

We propose several prediction intervals procedures for the individual tr...
research
07/22/2019

Asymptotic normality, concentration, and coverage of generalized posteriors

Generalized likelihoods are commonly used to obtain consistent estimator...
research
05/28/2021

Bayes-optimal prediction with frequentist coverage control

This article illustrates how indirect or prior information can be optima...
research
07/02/2020

A Scale-free Approach for False Discovery Rate Control in Generalized Linear Models

The generalized linear models (GLM) have been widely used in practice to...
research
04/28/2021

Finite-sample Efficient Conformal Prediction

Conformal prediction is a generic methodology for finite-sample valid di...
research
07/24/2020

CD-split: efficient conformal regions in high dimensions

Conformal methods create prediction bands that control average coverage ...
research
04/18/2022

Optimal Conformal Prediction for Small Areas

Existing inferential methods for small area data involve a trade-off bet...

Please sign up or login with your details

Forgot password? Click here to reset