Distribution-free inference for regression: discrete, continuous, and in between

05/28/2021
by   Yonghoon Lee, et al.
0

In data analysis problems where we are not able to rely on distributional assumptions, what types of inference guarantees can still be obtained? Many popular methods, such as holdout methods, cross-validation methods, and conformal prediction, are able to provide distribution-free guarantees for predictive inference, but the problem of providing inference for the underlying regression function (for example, inference on the conditional mean 𝔼[Y|X]) is more challenging. In the setting where the features X are continuously distributed, recent work has established that any confidence interval for 𝔼[Y|X] must have non-vanishing width, even as sample size tends to infinity. At the other extreme, if X takes only a small number of possible values, then inference on 𝔼[Y|X] is trivial to achieve. In this work, we study the problem in settings in between these two extremes. We find that there are several distinct regimes in between the finite setting and the continuous setting, where vanishing-width confidence intervals are achievable if and only if the effective support size of the distribution of X is smaller than the square of the sample size.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/20/2020

Is distribution-free inference possible for binary regression?

For a regression problem with a binary label response, we examine the pr...
research
07/01/2018

Calculation of sample size guaranteeing the required width of the empirical confidence interval with predefined probability

The goal of any estimation study is an interval estimation of a the para...
research
02/16/2021

Distribution-Free Conditional Median Inference

We consider the problem of constructing confidence intervals for the med...
research
01/12/2023

confidence-planner: Easy-to-Use Prediction Confidence Estimation and Sample Size Planning

Machine learning applications, especially in the fields of me­di­cine an...
research
09/05/2018

Conditional predictive inference for high-dimensional stable algorithms

We investigate generically applicable and intuitively appealing predicti...
research
05/08/2019

Predictive inference with the jackknife+

This paper introduces the jackknife+, which is a novel method for constr...
research
10/18/2022

Heteroscedasticity-aware sample trimming for causal inference

A popular method for variance reduction in observational causal inferenc...

Please sign up or login with your details

Forgot password? Click here to reset