Calibration Error Estimation Using Fuzzy Binning

04/30/2023
by   Geetanjali Bihani, et al.
0

Neural network-based decisions tend to be overconfident, where their raw outcome probabilities do not align with the true decision probabilities. Calibration of neural networks is an essential step towards more reliable deep learning frameworks. Prior metrics of calibration error primarily utilize crisp bin membership-based measures. This exacerbates skew in model probabilities and portrays an incomplete picture of calibration error. In this work, we propose a Fuzzy Calibration Error metric (FCE) that utilizes a fuzzy binning approach to calculate calibration error. This approach alleviates the impact of probability skew and provides a tighter estimate while measuring calibration error. We compare our metric with ECE across different data populations and class memberships. Our results show that FCE offers better calibration error estimation, especially in multi-class settings, alleviating the effects of skew in model confidence scores on calibration error estimation. We make our code and supplementary materials available at: \href{https://github.com/bihani-g/fce}{https://github.com/bihani-g/fce}

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/02/2019

Measuring Calibration in Deep Learning

The reliability of a machine learning model's confidence in its predicti...
research
09/20/2018

Spline-Based Probability Calibration

In many classification problems it is desirable to output well-calibrate...
research
09/12/2023

Rethinking Evaluation Metric for Probability Estimation Models Using Esports Data

Probability estimation models play an important role in various fields, ...
research
06/25/2023

TCE: A Test-Based Approach to Measuring Calibration Error

This paper proposes a new metric to measure the calibration error of pro...
research
10/07/2022

Class-wise and reduced calibration methods

For many applications of probabilistic classifiers it is important that ...
research
10/28/2022

Stop Measuring Calibration When Humans Disagree

Calibration is a popular framework to evaluate whether a classifier know...
research
05/25/2019

EPCI: A New Tool for Predicting Absolute Permeability from CT images

A new and fast Matlab algorithm for predicting absolute permeability is ...

Please sign up or login with your details

Forgot password? Click here to reset