Domain Generalization Needs Stochastic Weight Averaging for Robustness on Domain Shifts

02/17/2021
by   Junbum Cha, et al.
0

Domain generalization aims to learn a generalizable model to unseen target domains from multiple source domains. Various approaches have been proposed to address this problem. However, recent benchmarks show that most of them do not provide significant improvements compared to the simple empirical risk minimization (ERM) in practical cases. In this paper, we analyze how ERM works in views of domain-invariant feature learning and domain-specific gradient normalization. In addition, we observe that ERM converges to a loss valley shared over multiple training domains and obtain an insight that a center of the valley generalizes better. To estimate the center, we employ stochastic weight averaging (SWA) and provide theoretical analysis describing how SWA supports the generalization bound for an unseen domain. As a result, we achieve state-of-the-art performances over all of widely used domain generalization benchmarks, namely PACS, VLCS, OfficeHome, TerraIncognita, and DomainNet with large margins. Further analysis reveals how SWA operates on domain generalization tasks.

READ FULL TEXT

page 4

page 15

page 16

page 17

page 18

page 19

page 20

page 21

research
08/03/2022

Adaptive Domain Generalization via Online Disagreement Minimization

Deep neural networks suffer from significant performance deterioration w...
research
05/26/2023

Domain Aligned Prefix Averaging for Domain Generalization in Abstractive Summarization

Domain generalization is hitherto an underexplored area applied in abstr...
research
09/29/2022

Learning Gradient-based Mixup towards Flatter Minima for Domain Generalization

To address the distribution shifts between training and test data, domai...
research
02/28/2023

DART: Diversify-Aggregate-Repeat Training Improves Generalization of Neural Networks

Generalization of neural networks is crucial for deploying them safely i...
research
08/02/2022

Implicit Semantic Augmentation for Distance Metric Learning in Domain Generalization

Domain generalization (DG) aims to learn a model on one or more differen...
research
07/25/2019

Domain Generalization via Multidomain Discriminant Analysis

Domain generalization (DG) aims to incorporate knowledge from multiple s...
research
10/02/2021

Domain-Specific Bias Filtering for Single Labeled Domain Generalization

Domain generalization (DG) utilizes multiple labeled source datasets to ...

Please sign up or login with your details

Forgot password? Click here to reset