Auditing Pointwise Reliability Subsequent to Training

01/02/2019
by   Peter Schulam, et al.
0

To use machine learning in high stakes applications (e.g. medicine), we need tools for building confidence in the system and evaluating whether it is reliable. Methods to improve model reliability are often applied at train time (e.g. using Bayesian inference to obtain uncertainty estimates). An alternative is to audit a fixed model subsequent to training. In this paper, we describe resampling uncertainty estimation (RUE), an algorithm to audit the pointwise reliability of predictions. Intuitively, RUE estimates the amount that a single prediction would change if the model had been fit on different training data drawn from the same distribution by using the gradient and Hessian of the model's loss on training data. Experimentally, we show that RUE more effectively detects inaccurate predictions than existing tools for auditing reliability subsequent to training. We also show that RUE can create predictive distributions that are competitive with state-of-the-art methods like Monte Carlo dropout, probabilistic backpropagation, and deep ensembles, but does not depend on specific algorithms at train-time like these methods do.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/02/2019

Can You Trust This Prediction? Auditing Pointwise Reliability After Learning

To use machine learning in high stakes applications (e.g. medicine), we ...
research
11/21/2022

ZigZag: Universal Sampling-free Uncertainty Estimation Through Two-Step Inference

Whereas the ability of deep networks to produce useful predictions on ma...
research
06/26/2020

Unlabelled Data Improves Bayesian Uncertainty Calibration under Covariate Shift

Modern neural networks have proven to be powerful function approximators...
research
09/16/2019

Prediction Uncertainty Estimation for Hate Speech Classification

As a result of social network popularity, in recent years, hate speech p...
research
12/15/2020

Masksembles for Uncertainty Estimation

Deep neural networks have amply demonstrated their prowess but estimatin...
research
08/20/2019

n-MeRCI: A new Metric to Evaluate the Correlation Between Predictive Uncertainty and True Error

As deep learning applications are becoming more and more pervasive in ro...
research
10/24/2022

Reliability-Aware Prediction via Uncertainty Learning for Person Image Retrieval

Current person image retrieval methods have achieved great improvements ...

Please sign up or login with your details

Forgot password? Click here to reset