Is the Performance of My Deep Network Too Good to Be True? A Direct Approach to Estimating the Bayes Error in Binary Classification

02/01/2022
by   Takashi Ishida, et al.
0

There is a fundamental limitation in the prediction performance that a machine learning model can achieve due to the inevitable uncertainty of the prediction target. In classification problems, this can be characterized by the Bayes error, which is the best achievable error with any classifier. The Bayes error can be used as a criterion to evaluate classifiers with state-of-the-art performance and can be used to detect test set overfitting. We propose a simple and direct Bayes error estimator, where we just take the mean of the labels that show uncertainty of the classes. Our flexible approach enables us to perform Bayes error estimation even for weakly supervised data. In contrast to others, our method is model-free and even instance-free. Moreover, it has no hyperparameters and gives a more accurate estimate of the Bayes error than classifier-based baselines. Experiments using our method suggest that a recently proposed classifier, the Vision Transformer, may have already reached the Bayes error for certain benchmark datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/16/2019

Learning to Benchmark: Determining Best Achievable Misclassification Error from Training Data

We address the problem of learning to benchmark the best achievable clas...
research
06/07/2021

Evaluating State-of-the-Art Classification Models Against Bayes Optimality

Evaluating the inherent difficulty of a given data-driven classification...
research
03/24/2022

The Dutch Draw: Constructing a Universal Baseline for Binary Prediction Models

Novel prediction methods should always be compared to a baseline to know...
research
11/09/2017

Fast Meta-Learning for Adaptive Hierarchical Classifier Design

We propose a new splitting criterion for a meta-learning approach to mul...
research
10/31/2017

Rate-optimal Meta Learning of Classification Error

Meta learning of optimal classifier error rates allows an experimenter t...
research
04/27/2015

Meta learning of bounds on the Bayes classifier error

Meta learning uses information from base learners (e.g. classifiers or e...
research
05/14/2020

Training conformal predictors

Efficiency criteria for conformal prediction, such as observed fuzziness...

Please sign up or login with your details

Forgot password? Click here to reset