Does Your Model Know the Digit 6 Is Not a Cat? A Less Biased Evaluation of "Outlier" Detectors

09/13/2018
by   Alireza Shafaei, et al.
4

In the real world, a learning system could receive an input that looks nothing like anything it has seen during training, and this can lead to unpredictable behaviour. We thus need to know whether any given input belongs to the population distribution of the training data to prevent unpredictable behaviour in deployed systems. A recent surge of interest on this problem has led to the development of sophisticated techniques in the deep learning literature. However, due to the absence of a standardized problem formulation or an exhaustive evaluation, it is not evident if we can rely on these methods in practice. What makes this problem different from a typical supervised learning setting is that we cannot model the diversity of out-of-distribution samples in practice. The distribution of outliers used in training may not be the same as the distribution of outliers encountered in the application. Therefore, classical approaches that learn inliers vs. outliers with only two datasets can yield optimistic results. We introduce OD-test, a three-dataset evaluation scheme as a practical and more reliable strategy to assess progress on this problem. The OD-test benchmark provides a straightforward means of comparison for methods that address the out-of-distribution sample detection problem. We present an exhaustive evaluation of a broad set of methods from related areas on image classification tasks. Furthermore, we show that for realistic applications of high-dimensional images, the existing methods have low accuracy. Our analysis reveals areas of strength and weakness of each method.

READ FULL TEXT

page 2

page 7

page 13

page 21

page 22

research
07/18/2023

Towards Trustworthy Dataset Distillation

Efficiency and trustworthiness are two eternal pursuits when applying de...
research
08/20/2022

Evaluating Out-of-Distribution Detectors Through Adversarial Generation of Outliers

A reliable evaluation method is essential for building a robust out-of-d...
research
06/03/2023

DOS: Diverse Outlier Sampling for Out-of-Distribution Detection

Modern neural networks are known to give overconfident prediction for ou...
research
02/02/2022

VOS: Learning What You Don't Know by Virtual Outlier Synthesis

Out-of-distribution (OOD) detection has received much attention lately d...
research
04/25/2018

Solving Minimum Enclosing Ball with Outliers: Algorithm, Implementation, and Application

Motivated by the arising realistic issues in big data, the problem of Mi...
research
04/08/2021

Does Your Dermatology Classifier Know What It Doesn't Know? Detecting the Long-Tail of Unseen Conditions

We develop and rigorously evaluate a deep learning based system that can...
research
10/30/2018

Informed Democracy: Voting-based Novelty Detection for Action Recognition

Novelty detection is crucial for real-life applications. While it is com...

Please sign up or login with your details

Forgot password? Click here to reset