On the Usefulness of the Fit-on-the-Test View on Evaluating Calibration of Classifiers

03/16/2022
by   Markus Kängsepp, et al.
0

Every uncalibrated classifier has a corresponding true calibration map that calibrates its confidence. Deviations of this idealistic map from the identity map reveal miscalibration. Such calibration errors can be reduced with many post-hoc calibration methods which fit some family of calibration maps on a validation dataset. In contrast, evaluation of calibration with the expected calibration error (ECE) on the test set does not explicitly involve fitting. However, as we demonstrate, ECE can still be viewed as if fitting a family of functions on the test data. This motivates the fit-on-the-test view on evaluation: first, approximate a calibration map on the test data, and second, quantify its distance from the identity. Exploiting this view allows us to unlock missed opportunities: (1) use the plethora of post-hoc calibration methods for evaluating calibration; (2) tune the number of bins in ECE with cross-validation. Furthermore, we introduce: (3) benchmarking on pseudo-real data where the true calibration map can be estimated very precisely; and (4) novel calibration and evaluation methods using new calibration map families PL and PL3.

READ FULL TEXT

page 9

page 32

page 33

research
12/20/2020

Post-hoc Uncertainty Calibration for Domain Drift Scenarios

We address the problem of uncertainty calibration. While standard deep n...
research
05/10/2021

Meta-Cal: Well-controlled Post-hoc Calibration by Ranking

In many applications, it is desirable that a classifier not only makes a...
research
07/13/2022

Estimating Classification Confidence Using Kernel Densities

This paper investigates the post-hoc calibration of confidence for "expl...
research
03/15/2020

Intra Order-preserving Functions for Calibration of Multi-Class Neural Networks

Predicting calibrated confidence scores for multi-class deep networks is...
research
02/10/2022

Heterogeneous Calibration: A post-hoc model-agnostic framework for improved generalization

We introduce the notion of heterogeneous calibration that applies a post...
research
06/08/2023

Stratification of uncertainties recalibrated by isotonic regression and its impact on calibration error statistics

Abstract Post hoc recalibration of prediction uncertainties of machine l...
research
02/20/2021

On Calibration and Out-of-domain Generalization

Out-of-domain (OOD) generalization is a significant challenge for machin...

Please sign up or login with your details

Forgot password? Click here to reset