Context is Environment

09/18/2023
by   Sharut Gupta, et al.
0

Two lines of work are taking the central stage in AI research. On the one hand, the community is making increasing efforts to build models that discard spurious correlations and generalize better in novel test environments. Unfortunately, the bitter lesson so far is that no proposal convincingly outperforms a simple empirical risk minimization baseline. On the other hand, large language models (LLMs) have erupted as algorithms able to learn in-context, generalizing on-the-fly to eclectic contextual circumstances that users enforce by means of prompting. In this paper, we argue that context is environment, and posit that in-context learning holds the key to better domain generalization. Via extensive theory and experiments, we show that paying attention to contextx2013x2013unlabeled examples as they arrivex2013x2013allows our proposed In-Context Risk Minimization (ICRM) algorithm to zoom-in on the test environment risk minimizer, leading to significant out-of-distribution performance improvements. From all of this, two messages are worth taking home. Researchers in domain generalization should consider environment as context, and harness the adaptive power of in-context learning. Researchers in LLMs should consider context as environment, to better structure data towards generalization.

READ FULL TEXT
research
08/30/2023

Domain Generalization without Excess Empirical Risk

Given data from diverse sets of distinct distributions, domain generaliz...
research
10/16/2021

Invariant Language Modeling

Modern pretrained language models are critical components of NLP pipelin...
research
07/20/2023

Sharpness Minimization Algorithms Do Not Only Minimize Sharpness To Achieve Better Generalization

Despite extensive studies, the underlying reason as to why overparameter...
research
06/05/2022

Impossibility of Collective Intelligence

Democratization of AI involves training and deploying machine learning m...
research
06/18/2021

Iterative Feature Matching: Toward Provable Domain Generalization with Logarithmic Environments

Domain generalization aims at performing well on unseen test environment...
research
09/18/2019

Towards Shape Biased Unsupervised Representation Learning for Domain Generalization

It is known that, without awareness of the process, our brain appears to...
research
03/17/2023

Finding Competence Regions in Domain Generalization

We propose a "learning to reject" framework to address the problem of si...

Please sign up or login with your details

Forgot password? Click here to reset