SAND-mask: An Enhanced Gradient Masking Strategy for the Discovery of Invariances in Domain Generalization

06/04/2021
by   Soroosh Shahtalebi, et al.
8

A major bottleneck in the real-world applications of machine learning models is their failure in generalizing to unseen domains whose data distribution is not i.i.d to the training domains. This failure often stems from learning non-generalizable features in the training domains that are spuriously correlated with the label of data. To address this shortcoming, there has been a growing surge of interest in learning good explanations that are hard to vary, which is studied under the notion of Out-of-Distribution (OOD) Generalization. The search for good explanations that are invariant across different domains can be seen as finding local (global) minimas in the loss landscape that hold true across all of the training domains. In this paper, we propose a masking strategy, which determines a continuous weight based on the agreement of gradients that flow in each edge of network, in order to control the amount of update received by the edge in each step of optimization. Particularly, our proposed technique referred to as "Smoothed-AND (SAND)-masking", not only validates the agreement in the direction of gradients but also promotes the agreement among their magnitudes to further ensure the discovery of invariances across training domains. SAND-mask is validated over the Domainbed benchmark for domain generalization and significantly improves the state-of-the-art accuracy on the Colored MNIST dataset while providing competitive results on other domain generalization datasets.

READ FULL TEXT
research
08/03/2021

Domain Generalization via Gradient Surgery

In real-life applications, machine learning models often face scenarios ...
research
09/07/2021

Fishr: Invariant Gradient Variances for Out-of-distribution Generalization

Learning robust models that generalize well under changes in the data di...
research
04/20/2021

Gradient Matching for Domain Generalization

Machine learning systems typically assume that the distributions of trai...
research
05/02/2023

PGrad: Learning Principal Gradients For Domain Generalization

Machine learning models fail to perform when facing out-of-distribution ...
research
02/17/2021

Robust Domain-Free Domain Generalization with Class-aware Alignment

While deep neural networks demonstrate state-of-the-art performance on a...
research
04/05/2023

Domain Generalization with Adversarial Intensity Attack for Medical Image Segmentation

Most statistical learning algorithms rely on an over-simplified assumpti...
research
09/01/2020

Learning explanations that are hard to vary

In this paper, we investigate the principle that `good explanations are ...

Please sign up or login with your details

Forgot password? Click here to reset