Conformalization of Sparse Generalized Linear Models

07/11/2023
by   Etash Kumar Guha, et al.
0

Given a sequence of observable variables {(x_1, y_1), …, (x_n, y_n)}, the conformal prediction method estimates a confidence set for y_n+1 given x_n+1 that is valid for any finite sample size by merely assuming that the joint distribution of the data is permutation invariant. Although attractive, computing such a set is computationally infeasible in most regression problems. Indeed, in these cases, the unknown variable y_n+1 can take an infinite number of possible candidate values, and generating conformal sets requires retraining a predictive model for each candidate. In this paper, we focus on a sparse linear model with only a subset of variables for prediction and use numerical continuation techniques to approximate the solution path efficiently. The critical property we exploit is that the set of selected variables is invariant under a small perturbation of the input data. Therefore, it is sufficient to enumerate and refit the model only at the change points of the set of active features and smoothly interpolate the rest of the solution via a Predictor-Corrector mechanism. We show how our path-following algorithm accurately approximates conformal prediction sets and illustrate its performance using synthetic and real data examples.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/19/2021

Stable Conformal Prediction Sets

When one observes a sequence of variables (x_1, y_1), ..., (x_n, y_n), c...
research
04/29/2011

Model Selection Consistency for Cointegrating Regressions

We study the asymptotic properties of the adaptive Lasso in cointegratio...
research
02/04/2022

Model Averaging for Generalized Linear Models in Fragmentary Data Prediction

Fragmentary data is becoming more and more popular in many areas which b...
research
12/13/2019

Understanding complex predictive models with Ghost Variables

We propose a procedure for assigning a relevance measure to each explana...
research
01/27/2017

Subset Selection for Multiple Linear Regression via Optimization

Subset selection in multiple linear regression is to choose a subset of ...
research
04/09/2023

Maximum Agreement Linear Prediction via the Concordance Correlation Coefficient

This paper examines distributional properties and predictive performance...
research
04/14/2021

Root-finding Approaches for Computing Conformal Prediction Set

Conformal prediction constructs a confidence set for an unobserved respo...

Please sign up or login with your details

Forgot password? Click here to reset