Towards Clear Expectations for Uncertainty Estimation

07/27/2022
by   Victor Bouvier, et al.
21

If Uncertainty Quantification (UQ) is crucial to achieve trustworthy Machine Learning (ML), most UQ methods suffer from disparate and inconsistent evaluation protocols. We claim this inconsistency results from the unclear requirements the community expects from UQ. This opinion paper offers a new perspective by specifying those requirements through five downstream tasks where we expect uncertainty scores to have substantial predictive power. We design these downstream tasks carefully to reflect real-life usage of ML models. On an example benchmark of 7 classification datasets, we did not observe statistical superiority of state-of-the-art intrinsic UQ methods against simple baselines. We believe that our findings question the very rationale of why we quantify uncertainty and call for a standardized protocol for UQ evaluation based on metrics proven to be relevant for the ML practitioner.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/23/2022

Benchmarking Bayesian Deep Learning on Diabetic Retinopathy Detection Tasks

Bayesian deep learning seeks to equip deep neural networks with the abil...
research
05/16/2023

Synthetic data, real errors: how (not) to publish and use synthetic data

Generating synthetic data through generative models is gaining interest ...
research
01/13/2023

A Rigorous Uncertainty-Aware Quantification Framework Is Essential for Reproducible and Replicable Machine Learning Workflows

The ability to replicate predictions by machine learning (ML) or artific...
research
11/06/2019

Unfairness towards subjective opinions in Machine Learning

Despite the high interest for Machine Learning (ML) in academia and indu...
research
05/07/2023

Uncertainty Quantification in Machine Learning for Engineering Design and Health Prognostics: A Tutorial

On top of machine learning models, uncertainty quantification (UQ) funct...
research
05/18/2022

SoK: The Impact of Unlabelled Data in Cyberthreat Detection

Machine learning (ML) has become an important paradigm for cyberthreat d...
research
11/28/2022

A Call to Reflect on Evaluation Practices for Failure Detection in Image Classification

Reliable application of machine learning-based decision systems in the w...

Please sign up or login with your details

Forgot password? Click here to reset