Verified Uncertainty Calibration

09/23/2019
by   Ananya Kumar, et al.
2

Applications such as weather forecasting and personalized medicine demand models that output calibrated probability estimates - those representative of the true likelihood of a prediction. Most models are not calibrated out of the box but are recalibrated by post-processing model outputs. We find in this work that popular recalibration methods like Platt scaling and temperature scaling, are (i) less calibrated than reported and (ii) current techniques cannot estimate how miscalibrated they are. An alternative method, histogram binning, has measurable calibration error but is sample inefficient - it requires O(B/ϵ^2) samples, compared to O(1/ϵ^2) for scaling methods, where B is the number of distinct probabilities the model can output. To get the best of both worlds, we introduce the scaling-binning calibrator, which first fits a parametric function that acts like a baseline for variance reduction and then bins the function values to actually ensure calibration. This requires only O(1/ϵ^2 + B) samples. We then show that methods used to estimate calibration error are suboptimal - we prove that an alternative estimator introduced in the meteorological community requires fewer samples - samples proportional to √(B) instead of B. We validate our approach with multiclass calibration experiments on CIFAR-10 and ImageNet, where we obtain a 35 unlike scaling methods, guarantees on true calibration.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/09/2022

MBCT: Tree-Based Feature-Aware Binning for Individual Uncertainty Calibration

Most machine learning classifiers only concern classification accuracy, ...
research
12/09/2021

Obtaining Calibrated Probabilities with Personalized Ranking Models

For personalized ranking models, the well-calibrated probability of an i...
research
05/10/2021

Distribution-free calibration guarantees for histogram binning without sample splitting

We prove calibration guarantees for the popular histogram binning (also ...
research
09/23/2022

Neural Clamping: Joint Input Perturbation and Temperature Scaling for Neural Network Calibration

Neural network calibration is an essential task in deep learning to ensu...
research
02/25/2021

Confidence Calibration with Bounded Error Using Transformations

As machine learning techniques become widely adopted in new domains, esp...
research
04/28/2023

Online Platt Scaling with Calibeating

We present an online post-hoc calibration method, called Online Platt Sc...
research
04/30/2018

Explaining Constraint Interaction: How to Interpret Estimated Model Parameters under Alternative Scaling Methods

In this paper, we explain the reasons behind constraint interaction, whi...

Please sign up or login with your details

Forgot password? Click here to reset