INDIGO: Intrinsic Multimodality for Domain Generalization

06/13/2022
by   Puneet Mangla, et al.
0

For models to generalize under unseen domains (a.k.a domain generalization), it is crucial to learn feature representations that are domain-agnostic and capture the underlying semantics that makes up an object category. Recent advances towards weakly supervised vision-language models that learn holistic representations from cheap weakly supervised noisy text annotations have shown their ability on semantic understanding by capturing object characteristics that generalize under different domains. However, when multiple source domains are involved, the cost of curating textual annotations for every image in the dataset can blow up several times, depending on their number. This makes the process tedious and infeasible, hindering us from directly using these supervised vision-language approaches to achieve the best generalization on an unseen domain. Motivated from this, we study how multimodal information from existing pre-trained multimodal networks can be leveraged in an "intrinsic" way to make systems generalize under unseen domains. To this end, we propose IntriNsic multimodality for DomaIn GeneralizatiOn (INDIGO), a simple and elegant way of leveraging the intrinsic modality present in these pre-trained multimodal networks along with the visual modality to enhance generalization to unseen domains at test-time. We experiment on several Domain Generalization settings (ClosedDG, OpenDG, and Limited sources) and show state-of-the-art generalization performance on unseen domains. Further, we provide a thorough analysis to develop a holistic understanding of INDIGO.

READ FULL TEXT
research
07/15/2021

Context-Conditional Adaptation for Recognizing Unseen Classes in Unseen Domains

Recent progress towards designing models that can generalize to unseen d...
research
08/28/2020

Learning to Balance Specificity and Invariance for In and Out of Domain Generalization

We introduce Domain-specific Masks for Generalization, a model for impro...
research
08/19/2023

TDG: Text-guided Domain Generalization

Domain generalization (DG) attempts to generalize a model trained on sin...
research
01/23/2021

Hierarchical Domain Invariant Variational Auto-Encoding with weak domain supervision

We address the task of domain generalization, where the goal is to train...
research
12/08/2022

Learning Domain Invariant Prompt for Vision-Language Models

Prompt learning is one of the most effective and trending ways to adapt ...
research
06/27/2023

A Weakly Supervised Classifier and Dataset of White Supremacist Language

We present a dataset and classifier for detecting the language of white ...
research
10/24/2022

Unsupervised Term Extraction for Highly Technical Domains

Term extraction is an information extraction task at the root of knowled...

Please sign up or login with your details

Forgot password? Click here to reset