Spuriosity Didn't Kill the Classifier: Using Invariant Predictions to Harness Spurious Features

07/19/2023
by   Cian Eastwood, et al.
0

To avoid failures on out-of-distribution data, recent works have sought to extract features that have a stable or invariant relationship with the label across domains, discarding the "spurious" or unstable features whose relationship with the label changes across domains. However, unstable features often carry complementary information about the label that could boost performance if used correctly in the test domain. Our main contribution is to show that it is possible to learn how to use these unstable features in the test domain without labels. In particular, we prove that pseudo-labels based on stable features provide sufficient guidance for doing so, provided that stable and unstable features are conditionally independent given the label. Based on this theoretical insight, we propose Stable Feature Boosting (SFB), an algorithm for: (i) learning a predictor that separates stable and conditionally-independent unstable features; and (ii) using the stable-feature predictions to adapt the unstable-feature predictions in the test domain. Theoretically, we prove that SFB can learn an asymptotically-optimal predictor without test-domain labels. Empirically, we demonstrate the effectiveness of SFB on real and synthetic data.

READ FULL TEXT
research
07/23/2018

Domain Generalization via Conditional Invariant Representation

Domain generalization aims to apply knowledge gained from multiple label...
research
10/04/2021

Learning Domain-Invariant Relationship with Instrumental Variable for Domain Generalization

Domain generalization (DG) aims to learn from multiple source domains a ...
research
07/21/2020

Domain Generalization with Optimal Transport and Metric Learning

Generalizing knowledge to unseen domains, where data and labels are unav...
research
01/23/2019

kd-switch: A Universal Online Predictor with an application to Sequential Two-Sample Testing

We propose a novel online predictor for discrete labels conditioned on m...
research
11/04/2021

Testing using Privileged Information by Adapting Features with Statistical Dependence

Given an imperfect predictor, we exploit additional features at test tim...
research
01/15/2021

Harmonization and the Worst Scanner Syndrome

We show that for a wide class of harmonization/domain-invariance schemes...
research
02/14/2022

Domain-Adjusted Regression or: ERM May Already Learn Features Sufficient for Out-of-Distribution Generalization

A common explanation for the failure of deep networks to generalize out-...

Please sign up or login with your details

Forgot password? Click here to reset