Learning Representations that Support Robust Transfer of Predictors

10/19/2021
by   Yilun Xu, et al.
0

Ensuring generalization to unseen environments remains a challenge. Domain shift can lead to substantially degraded performance unless shifts are well-exercised within the available training environments. We introduce a simple robust estimation criterion – transfer risk – that is specifically geared towards optimizing transfer to new environments. Effectively, the criterion amounts to finding a representation that minimizes the risk of applying any optimal predictor trained on one environment to another. The transfer risk essentially decomposes into two terms, a direct transfer term and a weighted gradient-matching term arising from the optimality of per-environment predictors. Although inspired by IRM, we show that transfer risk serves as a better out-of-distribution generalization criterion, both theoretically and empirically. We further demonstrate the impact of optimizing such transfer risk on two controlled settings, each representing a different pattern of environment shift, as well as on two real-world datasets. Experimentally, the approach outperforms baselines across various out-of-distribution generalization tasks. Code is available at <https://github.com/Newbeeer/TRM>.

READ FULL TEXT
research
05/13/2021

Causally-motivated Shortcut Removal Using Auxiliary Labels

Robustness to certain distribution shifts is a key requirement in many M...
research
06/21/2022

Performance Prediction Under Dataset Shift

ML models deployed in production often have to face unknown domain chang...
research
06/06/2020

Domain Extrapolation via Regret Minimization

Many real prediction tasks such as molecular property prediction require...
research
07/28/2023

Optimal multi-environment causal regularization

In this manuscript we derive the optimal out-of-sample causal predictor ...
research
08/22/2023

Domain Generalization via Rationale Invariance

This paper offers a new perspective to ease the challenge of domain gene...
research
06/16/2022

Channel Importance Matters in Few-Shot Image Classification

Few-Shot Learning (FSL) requires vision models to quickly adapt to brand...
research
10/12/2021

Gated Information Bottleneck for Generalization in Sequential Environments

Deep neural networks suffer from poor generalization to unseen environme...

Please sign up or login with your details

Forgot password? Click here to reset