Gradient Estimation for Unseen Domain Risk Minimization with Pre-Trained Models

02/03/2023
by   Byunggyu Lew, et al.
0

Domain generalization aims to build generalized models that perform well on unseen domains when only source domains are available for model optimization. Recent studies have demonstrated that large-scale pre-trained models could play an important role in domain generalization by providing their generalization power. However, large-scale pre-trained models are not fully equipped with target task-specific knowledge due to a discrepancy between the pre-training objective and the target task. Although the task-specific knowledge could be learned from source domains by fine-tuning, this hurts the generalization power of the pre-trained models because of gradient bias toward the source domains. To address this issue, we propose a new domain generalization method that estimates unobservable gradients that reduce potential risks in unseen domains, using a large-scale pre-trained model. Our proposed method allows the pre-trained model to learn task-specific knowledge further while preserving its generalization ability with the estimated gradients. Experimental results show that our proposed method outperforms baseline methods on DomainBed, a standard benchmark in domain generalization. We also provide extensive analyses to demonstrate that the estimated unobserved gradients relieve the gradient bias, and the pre-trained model learns the task-specific knowledge without sacrificing its generalization power.

READ FULL TEXT

page 2

page 19

research
03/21/2022

Domain Generalization by Mutual-Information Regularization with Pre-trained Models

Domain generalization (DG) aims to learn a generalized model to an unsee...
research
08/23/2022

Bag of Tricks for Out-of-Distribution Generalization

Recently, out-of-distribution (OOD) generalization has attracted attenti...
research
03/05/2023

PyramidFlow: High-Resolution Defect Contrastive Localization using Pyramid Normalizing Flow

During industrial processing, unforeseen defects may arise in products d...
research
08/19/2023

TDG: Text-guided Domain Generalization

Domain generalization (DG) attempts to generalize a model trained on sin...
research
09/18/2023

Multi-modality Meets Re-learning: Mitigating Negative Transfer in Sequential Recommendation

Learning effective recommendation models from sparse user interactions r...
research
04/12/2020

Gradients as Features for Deep Representation Learning

We address the challenging problem of deep representation learning–the e...
research
12/19/2020

Generalize a Small Pre-trained Model to Arbitrarily Large TSP Instances

For the traveling salesman problem (TSP), the existing supervised learni...

Please sign up or login with your details

Forgot password? Click here to reset