Domain Generalization by Mutual-Information Regularization with Pre-trained Models

03/21/2022
by   Junbum Cha, et al.
0

Domain generalization (DG) aims to learn a generalized model to an unseen target domain using only limited source domains. Previous attempts to DG fail to learn domain-invariant representations only from the source domains due to the significant domain shifts between training and test domains. Instead, we re-formulate the DG objective using mutual information with the oracle model, a model generalized to any possible domain. We derive a tractable variational lower bound via approximating the oracle model by a pre-trained model, called Mutual Information Regularization with Oracle (MIRO). Our extensive experiments show that MIRO significantly improves the out-of-distribution performance. Furthermore, our scaling experiments show that the larger the scale of the pre-trained model, the greater the performance improvement of MIRO. Source code is available at https://github.com/kakaobrain/miro.

READ FULL TEXT

page 5

page 9

research
02/03/2023

Gradient Estimation for Unseen Domain Risk Minimization with Pre-Trained Models

Domain generalization aims to build generalized models that perform well...
research
09/18/2023

DGM-DR: Domain Generalization with Mutual Information Regularized Diabetic Retinopathy Classification

The domain shift between training and testing data presents a significan...
research
06/28/2023

Individual and Structural Graph Information Bottlenecks for Out-of-Distribution Generalization

Out-of-distribution (OOD) graph generalization are critical for many rea...
research
11/29/2022

Towards Generalized Open Information Extraction

Open Information Extraction (OpenIE) facilitates the open-domain discove...
research
03/24/2023

Enhancing Multiple Reliability Measures via Nuisance-extended Information Bottleneck

In practical scenarios where training data is limited, many predictive s...
research
03/22/2023

MI-SegNet: Mutual Information-Based US Segmentation for Unseen Domain Generalization

Generalization capabilities of learning-based medical image segmentation...
research
02/22/2022

Model Reprogramming: Resource-Efficient Cross-Domain Machine Learning

In data-rich domains such as vision, language, and speech, deep learning...

Please sign up or login with your details

Forgot password? Click here to reset