Out-of-Distribution Generalization Analysis via Influence Function

01/21/2021
by   Haotian Ye, et al.
6

The mismatch between training and target data is one major challenge for current machine learning systems. When training data is collected from multiple domains and the target domains include all training domains and other new domains, we are facing an Out-of-Distribution (OOD) generalization problem that aims to find a model with the best OOD accuracy. One of the definitions of OOD accuracy is worst-domain accuracy. In general, the set of target domains is unknown, and the worst over target domains may be unseen when the number of observed domains is limited. In this paper, we show that the worst accuracy over the observed domains may dramatically fail to identify the OOD accuracy. To this end, we introduce Influence Function, a classical tool from robust statistics, into the OOD generalization problem and suggest the variance of influence function to monitor the stability of a model on training domains. We show that the accuracy on test domains and the proposed index together can help us discern whether OOD algorithms are needed and whether a model achieves good OOD generalization.

READ FULL TEXT

page 3

page 18

research
04/27/2023

Moderately Distributional Exploration for Domain Generalization

Domain generalization (DG) aims to tackle the distribution shift between...
research
11/03/2019

Adversarial target-invariant representation learning for domain generalization

In many applications of machine learning, the training and test set data...
research
02/13/2020

Learn to Expect the Unexpected: Probably Approximately Correct Domain Generalization

Domain generalization is the problem of machine learning when the traini...
research
05/25/2023

On Influence Functions, Classification Influence, Relative Influence, Memorization and Generalization

Machine learning systems such as large scale recommendation systems or n...
research
11/12/2020

Domain Generalization in Biosignal Classification

Objective: When training machine learning models, we often assume that t...
research
06/08/2021

Towards a Theoretical Framework of Out-of-Distribution Generalization

Generalization to out-of-distribution (OOD) data, or domain generalizati...
research
03/02/2020

Out-of-Distribution Generalization via Risk Extrapolation (REx)

Generalizing outside of the training distribution is an open challenge f...

Please sign up or login with your details

Forgot password? Click here to reset