Distributionally Invariant Learning: Rationalization and Practical Algorithms

06/07/2022
by   Jiashuo Liu, et al.
0

The invariance property across environments is at the heart of invariant learning methods for the Out-of-Distribution (OOD) Generalization problem. Although intuitively reasonable, strong assumptions on the availability and quality of environments have to be made for the learnability of the strict invariance property. Recently, to relax the requirements for environments empirically, some works propose to learn pseudo-environments for invariant learning. However, it could be misleading when pursuing strict invariance under latent heterogeneity, since the underlying invariance could have been violated during the pseudo-environment learning procedure. To this end, we come up with the distributional invariance property as a relaxed alternative to the strict invariance, which considers the invariance only among sub-populations down to a prescribed scale and allows a certain degree of variation. We reformulate the invariant learning problem under latent heterogeneity into a relaxed form that pursues the distributional invariance, based on which we propose our novel Distributionally Invariant Learning (DIL) framework as well as two implementations named DIL-MMD and DIL-KL. Theoretically, we provide the guarantees for the distributional invariance as well as bounds of the generalization error gap. Extensive experimental results validate the effectiveness of our proposed algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/09/2021

Heterogeneous Risk Minimization

Machine learning algorithms with empirical risk minimization usually suf...
research
02/19/2020

Identifying Invariant Factors Across Multiple Environments with KL Regression

Many datasets are collected from multiple environments (e.g. different l...
research
10/24/2021

Kernelized Heterogeneous Risk Minimization

The ability to generalize under distributional shifts is essential to re...
research
06/12/2020

Traversal-invariant characterizations of logarithmic space

We give a novel descriptive-complexity theoretic characterization of L a...
research
05/30/2022

PAC Generalization via Invariant Representations

One method for obtaining generalizable solutions to machine learning tas...
research
03/11/2022

ZIN: When and How to Learn Invariance by Environment Inference?

It is commonplace to encounter heterogeneous data, of which some aspects...
research
03/04/2018

Accelerating Natural Gradient with Higher-Order Invariance

An appealing property of the natural gradient is that it is invariant to...

Please sign up or login with your details

Forgot password? Click here to reset