HMOE: Hypernetwork-based Mixture of Experts for Domain Generalization

11/15/2022
by   Jingang Qu, et al.
0

Due to the domain shift, machine learning systems typically fail to generalize well to domains different from those of training data, which is the problem that domain generalization (DG) aims to address. However, most mainstream DG algorithms lack interpretability and require domain labels, which are not available in many real-world scenarios. In this work, we propose a novel DG method, HMOE: Hypernetwork-based Mixture of Experts (MoE), that does not require domain labels and is more interpretable. We use hypernetworks to generate the weights of experts, allowing experts to share some useful meta-knowledge. MoE has proven adept at detecting and identifying heterogeneous patterns in data. For DG, heterogeneity exactly arises from the domain shift. We compare HMOE with other DG algorithms under a fair and unified benchmark-DomainBed. Extensive experiments show that HMOE can perform latent domain discovery from data of mixed domains and divide it into distinct clusters that are surprisingly more consistent with human intuition than original domain labels. Compared to other DG methods, HMOE shows competitive performance and achieves SOTA results in some cases without using domain labels.

READ FULL TEXT

page 8

page 13

research
05/25/2023

Quantitatively Measuring and Contrastively Exploring Heterogeneity for Domain Generalization

Domain generalization (DG) is a prevalent problem in real-world applicat...
research
11/18/2019

Domain Generalization Using a Mixture of Multiple Latent Domains

When domains, which represent underlying data distributions, vary during...
research
08/03/2022

Equivariant Disentangled Transformation for Domain Generalization under Combination Shift

Machine learning systems may encounter unexpected problems when the data...
research
10/08/2022

Meta-DMoE: Adapting to Domain Shift by Meta-Distillation from Mixture-of-Experts

In this paper, we tackle the problem of domain shift. Most existing meth...
research
06/28/2022

Domain Agnostic Few-shot Learning for Speaker Verification

Deep learning models for verification systems often fail to generalize t...
research
04/20/2021

Gradient Matching for Domain Generalization

Machine learning systems typically assume that the distributions of trai...
research
04/16/2021

Deep Stable Learning for Out-Of-Distribution Generalization

Approaches based on deep neural networks have achieved striking performa...

Please sign up or login with your details

Forgot password? Click here to reset