PGrad: Learning Principal Gradients For Domain Generalization

05/02/2023
by   Zhe Wang, et al.
5

Machine learning models fail to perform when facing out-of-distribution (OOD) domains, a challenging task known as domain generalization (DG). In this work, we develop a novel DG training strategy, we call PGrad, to learn a robust gradient direction, improving models' generalization ability on unseen domains. The proposed gradient aggregates the principal directions of a sampled roll-out optimization trajectory that measures the training dynamics across all training domains. PGrad's gradient design forces the DG training to ignore domain-dependent noise signals and updates all training domains with a robust direction covering main components of parameter dynamics. We further improve PGrad via bijection-based computational refinement and directional plus length-based calibrations. Our theoretical proof connects PGrad to the spectral analysis of Hessian in training neural networks. Experiments on DomainBed and WILDS benchmarks demonstrate that our approach effectively enables robust DG optimization and leads to smoothly decreased loss curves. Empirically, PGrad achieves competitive results across seven datasets, demonstrating its efficacy across both synthetic and real-world distributional shifts. Code is available at https://github.com/QData/PGrad.

READ FULL TEXT

page 2

page 19

research
04/20/2021

Gradient Matching for Domain Generalization

Machine learning systems typically assume that the distributions of trai...
research
08/22/2023

Understanding Hessian Alignment for Domain Generalization

Out-of-distribution (OOD) generalization is a critical ability for deep ...
research
09/07/2021

Fishr: Invariant Gradient Variances for Out-of-distribution Generalization

Learning robust models that generalize well under changes in the data di...
research
06/04/2021

SAND-mask: An Enhanced Gradient Masking Strategy for the Discovery of Invariances in Domain Generalization

A major bottleneck in the real-world applications of machine learning mo...
research
08/20/2023

DomainDrop: Suppressing Domain-Sensitive Channels for Domain Generalization

Deep Neural Networks have exhibited considerable success in various visu...
research
10/13/2022

Improving Out-of-Distribution Generalization by Adversarial Training with Structured Priors

Deep models often fail to generalize well in test domains when the data ...
research
08/22/2023

Domain Generalization via Rationale Invariance

This paper offers a new perspective to ease the challenge of domain gene...

Please sign up or login with your details

Forgot password? Click here to reset