Learning Prediction Intervals for Model Performance

12/15/2020
by   Benjamin Elder, et al.
0

Understanding model performance on unlabeled data is a fundamental challenge of developing, deploying, and maintaining AI systems. Model performance is typically evaluated using test sets or periodic manual quality assessments, both of which require laborious manual data labeling. Automated performance prediction techniques aim to mitigate this burden, but potential inaccuracy and a lack of trust in their predictions has prevented their widespread adoption. We address this core problem of performance prediction uncertainty with a method to compute prediction intervals for model performance. Our methodology uses transfer learning to train an uncertainty model to estimate the uncertainty of model performance predictions. We evaluate our approach across a wide range of drift conditions and show substantial improvement over competitive baselines. We believe this result makes prediction intervals, and performance prediction in general, significantly more practical for real-world use.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/12/2023

Confident Object Detection via Conformal Prediction and Conformal Risk Control: an Application to Railway Signaling

Deploying deep learning models in real-world certified systems requires ...
research
02/08/2022

The Lifecycle of a Statistical Model: Model Failure Detection, Identification, and Refitting

The statistical machine learning community has demonstrated considerable...
research
06/01/2021

Locally Valid and Discriminative Confidence Intervals for Deep Learning Models

Crucial for building trust in deep learning models for critical real-wor...
research
05/12/2021

An Empirical Experiment on Deep Learning Models for Predicting Traffic Data

To tackle ever-increasing city traffic congestion problems, researchers ...
research
04/25/2021

Model-based metrics: Sample-efficient estimates of predictive model subpopulation performance

Machine learning models - now commonly developed to screen, diagnose, or...
research
12/19/2019

Per-sample Prediction Intervals for Extreme Learning Machines

Prediction intervals in supervised Machine Learning bound the region whe...
research
06/01/2021

Uncertainty Characteristics Curves: A Systematic Assessment of Prediction Intervals

Accurate quantification of model uncertainty has long been recognized as...

Please sign up or login with your details

Forgot password? Click here to reset