ERM++: An Improved Baseline for Domain Generalization

04/04/2023
by   Piotr Teterwak, et al.
0

Multi-source Domain Generalization (DG) measures a classifier's ability to generalize to new distributions of data it was not trained on, given several training domains. While several multi-source DG methods have been proposed, they incur additional complexity during training by using domain labels. Recent work has shown that a well-tuned Empirical Risk Minimization (ERM) training procedure, that is simply minimizing the empirical risk on the source domains, can outperform most existing DG methods. We identify several key candidate techniques to further improve ERM performance, such as better utilization of training data, model parameter selection, and weight-space regularization. We call the resulting method ERM++, and show it significantly improves the performance of DG on five multi-source datasets by over 5 ERM, and beats state-of-the-art despite being less computationally expensive. Additionally, we demonstrate the efficacy of ERM++ on the WILDS-FMOW dataset, a challenging DG benchmark. We hope that ERM++ becomes a strong baseline for future DG research. Code is released at https://github.com/piotr-teterwak/erm_plusplus.

READ FULL TEXT

page 10

page 11

page 12

page 13

page 14

page 15

page 16

research
05/15/2022

Not to Overfit or Underfit? A Study of Domain Generalization in Question Answering

Machine learning models are prone to overfitting their source (training)...
research
02/01/2022

Finding lost DG: Explaining domain generalization via model complexity

The domain generalization (DG) problem setting challenges a model traine...
research
09/07/2021

Fishr: Invariant Gradient Variances for Out-of-distribution Generalization

Learning robust models that generalize well under changes in the data di...
research
10/15/2021

Reappraising Domain Generalization in Neural Networks

Domain generalization (DG) of machine learning algorithms is defined as ...
research
01/31/2019

Episodic Training for Domain Generalization

Domain generalization (DG) is the challenging and topical problem of lea...
research
09/11/2019

Domain Aggregation Networks for Multi-Source Domain Adaptation

In many real-world applications, we want to exploit multiple source data...
research
04/21/2023

Tree-structured Parzen estimator: Understanding its algorithm components and their roles for better empirical performance

Recent advances in many domains require more and more complicated experi...

Please sign up or login with your details

Forgot password? Click here to reset