Out of Distribution Generalization in Machine Learning

03/03/2021
by   Martin Arjovsky, et al.
0

Machine learning has achieved tremendous success in a variety of domains in recent years. However, a lot of these success stories have been in places where the training and the testing distributions are extremely similar to each other. In everyday situations when models are tested in slightly different data than they were trained on, ML algorithms can fail spectacularly. This research attempts to formally define this problem, what sets of assumptions are reasonable to make in our data and what kind of guarantees we hope to obtain from them. Then, we focus on a certain class of out of distribution problems, their assumptions, and introduce simple algorithms that follow from these assumptions that are able to provide more reliable generalization. A central topic in the thesis is the strong link between discovering the causal structure of the data, finding features that are reliable (when using them to predict) regardless of their context, and out of distribution generalization.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/23/2023

Generalization with quantum geometry for learning unitaries

Generalization is the ability of quantum machine learning models to make...
research
04/21/2022

Out-of-distribution generalization for learning quantum dynamics

Generalization bounds are a critical tool to assess the training data re...
research
10/20/2020

Where Is the Normative Proof? Assumptions and Contradictions in ML Fairness Research

Across machine learning (ML) sub-disciplines researchers make mathematic...
research
07/13/2023

A Causal Framework to Unify Common Domain Generalization Approaches

Domain generalization (DG) is about learning models that generalize well...
research
06/08/2021

Towards a Theoretical Framework of Out-of-Distribution Generalization

Generalization to out-of-distribution (OOD) data, or domain generalizati...
research
09/29/2021

Towards a theory of out-of-distribution learning

What is learning? 20^st century formalizations of learning theory – whic...
research
10/18/2022

Generalizing in the Real World with Representation Learning

Machine learning (ML) formalizes the problem of getting computers to lea...

Please sign up or login with your details

Forgot password? Click here to reset