Improved OOD Generalization via Conditional Invariant Regularizer

07/14/2022
by   Mingyang Yi, et al.
0

Recently, generalization on out-of-distribution (OOD) data with correlation shift has attracted great attention. The correlation shift is caused by the spurious attributes that correlate to the class label, as the correlation between them may vary in training and test data. For such a problem, we show that given the class label, the conditionally independent models of spurious attributes are OOD generalizable. Based on this, a metric Conditional Spurious Variation (CSV) which controls OOD generalization error, is proposed to measure such conditional independence. To improve the OOD generalization, we regularize the training process with the proposed CSV. Under mild assumptions, our training objective can be formulated as a nonconvex-concave mini-max problem. An algorithm with provable convergence rate is proposed to solve the problem. Extensive empirical results verify our algorithm's efficacy in improving OOD generalization.

READ FULL TEXT

page 2

page 32

research
12/17/2020

DecAug: Out-of-Distribution Generalization via Decomposed Feature Representation and Semantic Augmentation

While deep learning demonstrates its strong ability to handle independen...
research
03/15/2019

On Target Shift in Adversarial Domain Adaptation

Discrepancy between training and testing domains is a fundamental proble...
research
07/31/2021

Conditional Bures Metric for Domain Adaptation

As a vital problem in classification-oriented transfer, unsupervised dom...
research
12/18/2022

On the Connection between Invariant Learning and Adversarial Training for Out-of-Distribution Generalization

Despite impressive success in many tasks, deep learning models are shown...
research
12/29/2021

Disentanglement and Generalization Under Correlation Shifts

Correlations between factors of variation are prevalent in real-world da...
research
07/26/2019

Improving Generalization via Attribute Selection on Out-of-the-box Data

Zero-shot learning (ZSL) aims to recognize unseen objects (test classes)...
research
01/28/2022

Understanding Why Generalized Reweighting Does Not Improve Over ERM

Empirical risk minimization (ERM) is known in practice to be non-robust ...

Please sign up or login with your details

Forgot password? Click here to reset