Valid Inference for Machine Learning Model Parameters

02/21/2023
by   Neil Dey, et al.
0

The parameters of a machine learning model are typically learned by minimizing a loss function on a set of training data. However, this can come with the risk of overtraining; in order for the model to generalize well, it is of great importance that we are able to find the optimal parameter for the model on the entire population – not only on the given training sample. In this paper, we construct valid confidence sets for this optimal parameter of a machine learning model, which can be generated using only the training data without any knowledge of the population. We then show that studying the distribution of this confidence set allows us to assign a notion of confidence to arbitrary regions of the parameter space, and we demonstrate that this distribution can be well-approximated using bootstrapping techniques.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/01/2023

Joint Coverage Regions: Simultaneous Confidence and Prediction Sets

We introduce Joint Coverage Regions (JCRs), which unify confidence inter...
research
10/18/2019

Identification of Model Uncertainty via Optimal Design of Experiments applied to a Mechanical Press

In engineering applications almost all processes are described with the ...
research
03/28/2023

Understanding and Exploring the Whole Set of Good Sparse Generalized Additive Models

In real applications, interaction between machine learning model and dom...
research
11/22/2019

Optimizing Data Usage via Differentiable Rewards

To acquire a new skill, humans learn better and faster if a tutor, based...
research
11/09/2020

Risk Assessment for Machine Learning Models

In this paper we propose a framework for assessing the risk associated w...
research
10/16/2022

Learning Probabilities of Causation from Finite Population Data

This paper deals with the problem of learning the probabilities of causa...
research
02/20/2020

Differentially Private ERM Based on Data Perturbation

In this paper, after observing that different training data instances af...

Please sign up or login with your details

Forgot password? Click here to reset