Can You Trust This Prediction? Auditing Pointwise Reliability After Learning

01/02/2019
by   Peter Schulam, et al.
1

To use machine learning in high stakes applications (e.g. medicine), we need tools for building confidence in the system and evaluating whether it is reliable. Methods to improve model reliability often require new learning algorithms (e.g. using Bayesian inference to obtain uncertainty estimates). An alternative is to audit a model after it is trained. In this paper, we describe resampling uncertainty estimation (RUE), an algorithm to audit the pointwise reliability of predictions. Intuitively, RUE estimates the amount that a prediction would change if the model had been fit on different training data. The algorithm uses the gradient and Hessian of the model's loss function to create an ensemble of predictions. Experimentally, we show that RUE more effectively detects inaccurate predictions than existing tools for auditing reliability subsequent to training. We also show that RUE can create predictive distributions that are competitive with state-of-the-art methods like Monte Carlo dropout, probabilistic backpropagation, and deep ensembles, but does not depend on specific algorithms at train-time like these methods do.

READ FULL TEXT
research
01/02/2019

Auditing Pointwise Reliability Subsequent to Training

To use machine learning in high stakes applications (e.g. medicine), we ...
research
11/21/2022

ZigZag: Universal Sampling-free Uncertainty Estimation Through Two-Step Inference

Whereas the ability of deep networks to produce useful predictions on ma...
research
09/16/2019

Prediction Uncertainty Estimation for Hate Speech Classification

As a result of social network popularity, in recent years, hate speech p...
research
08/20/2019

n-MeRCI: A new Metric to Evaluate the Correlation Between Predictive Uncertainty and True Error

As deep learning applications are becoming more and more pervasive in ro...
research
12/15/2020

Masksembles for Uncertainty Estimation

Deep neural networks have amply demonstrated their prowess but estimatin...
research
04/17/2023

On Uncertainty Calibration and Selective Generation in Probabilistic Neural Summarization: A Benchmark Study

Modern deep models for summarization attains impressive benchmark perfor...
research
03/03/2021

Parsimonious Inference

Bayesian inference provides a uniquely rigorous approach to obtain princ...

Please sign up or login with your details

Forgot password? Click here to reset