Towards Out-Of-Distribution Generalization: A Survey

08/31/2021
by   Zheyan Shen, et al.
7

Classic machine learning methods are built on the i.i.d. assumption that training and testing data are independent and identically distributed. However, in real scenarios, the i.i.d. assumption can hardly be satisfied, rendering the sharp drop of classic machine learning algorithms' performances under distributional shifts, which indicates the significance of investigating the Out-of-Distribution generalization problem. Out-of-Distribution (OOD) generalization problem addresses the challenging setting where the testing distribution is unknown and different from the training. This paper serves as the first effort to systematically and comprehensively discuss the OOD generalization problem, from the definition, methodology, evaluation to the implications and future directions. Firstly, we provide the formal definition of the OOD generalization problem. Secondly, existing methods are categorized into three parts based on their positions in the whole learning pipeline, namely unsupervised representation learning, supervised model learning and optimization, and typical methods for each category are discussed in detail. We then demonstrate the theoretical connections of different categories, and introduce the commonly used datasets and evaluation metrics. Finally, we summarize the whole literature and raise some future directions for OOD generalization problem. The summary of OOD generalization methods reviewed in this survey can be found at http://out-of-distribution-generalization.com.

READ FULL TEXT

page 21

page 22

research
02/16/2022

Out-Of-Distribution Generalization on Graphs: A Survey

Graph machine learning has been extensively studied in both academia and...
research
06/02/2023

Federated Domain Generalization: A Survey

Machine learning typically relies on the assumption that training and te...
research
03/02/2021

Generalizing to Unseen Domains: A Survey on Domain Generalization

Domain generalization (DG), i.e., out-of-distribution generalization, ha...
research
06/15/2022

A Comprehensive Survey on Deep Clustering: Taxonomy, Challenges, and Future Directions

Clustering is a fundamental machine learning task which has been widely ...
research
11/09/2020

A Survey of Label-noise Representation Learning: Past, Present and Future

Classical machine learning implicitly assumes that labels of the trainin...
research
11/22/2022

A Short Survey of Systematic Generalization

This survey includes systematic generalization and a history of how mach...
research
09/29/2021

Towards a theory of out-of-distribution learning

What is learning? 20^st century formalizations of learning theory – whic...

Please sign up or login with your details

Forgot password? Click here to reset