Is Out-of-Distribution Detection Learnable?

10/26/2022
by   Zhen Fang, et al.
0

Supervised learning aims to train a classifier under the assumption that training and test data are from the same distribution. To ease the above assumption, researchers have studied a more realistic setting: out-of-distribution (OOD) detection, where test data may come from classes that are unknown during training (i.e., OOD data). Due to the unavailability and diversity of OOD data, good generalization ability is crucial for effective OOD detection algorithms. To study the generalization of OOD detection, in this paper, we investigate the probably approximately correct (PAC) learning theory of OOD detection, which is proposed by researchers as an open problem. First, we find a necessary condition for the learnability of OOD detection. Then, using this condition, we prove several impossibility theorems for the learnability of OOD detection under some scenarios. Although the impossibility theorems are frustrating, we find that some conditions of these impossibility theorems may not hold in some practical scenarios. Based on this observation, we next give several necessary and sufficient conditions to characterize the learnability of OOD detection in some practical scenarios. Lastly, we also offer theoretical supports for several representative OOD detection works based on our OOD theory.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/30/2021

Learning Bounds for Open-Set Learning

Traditional supervised learning aims to train a classifier in the closed...
research
05/24/2023

Rethinking the Evaluation Protocol of Domain Generalization

Domain generalization aims to solve the challenge of Out-of-Distribution...
research
02/13/2020

Learn to Expect the Unexpected: Probably Approximately Correct Domain Generalization

Domain generalization is the problem of machine learning when the traini...
research
04/24/2020

A survey on domain adaptation theory

All famous machine learning algorithms that correspond to both supervise...
research
12/10/2020

Learn what you can't learn: Regularized Ensembles for Transductive Out-of-distribution Detection

Machine learning models are often used in practice if they achieve good ...
research
12/05/2012

On the Convergence Properties of Optimal AdaBoost

AdaBoost is one of the most popular machine-learning algorithms. It is s...
research
07/21/2020

What is important about the No Free Lunch theorems?

The No Free Lunch theorems prove that under a uniform distribution over ...

Please sign up or login with your details

Forgot password? Click here to reset