Implicit Counterfactual Data Augmentation for Deep Neural Networks

04/26/2023
by   Xiaoling Zhou, et al.
0

Machine-learning models are prone to capturing the spurious correlations between non-causal attributes and classes, with counterfactual data augmentation being a promising direction for breaking these spurious associations. However, explicitly generating counterfactual data is challenging, with the training efficiency declining. Therefore, this study proposes an implicit counterfactual data augmentation (ICDA) method to remove spurious correlations and make stable predictions. Specifically, first, a novel sample-wise augmentation strategy is developed that generates semantically and counterfactually meaningful deep features with distinct augmentation strength for each sample. Second, we derive an easy-to-compute surrogate loss on the augmented feature set when the number of augmented samples becomes infinite. Third, two concrete schemes are proposed, including direct quantification and meta-learning, to derive the key parameters for the robust loss. In addition, ICDA is explained from a regularization aspect, with extensive experiments indicating that our method consistently improves the generalization performance of popular depth networks on multiple typical learning scenarios that require out-of-distribution generalization.

READ FULL TEXT

page 1

page 10

page 11

page 15

research
09/12/2022

Bias Challenges in Counterfactual Data Augmentation

Deep learning models tend not to be out-of-distribution robust primarily...
research
07/21/2020

Regularizing Deep Networks with Semantic Data Augmentation

Data augmentation is widely known as a simple yet surprisingly effective...
research
05/29/2023

Rethinking Counterfactual Data Augmentation Under Confounding

Counterfactual data augmentation has recently emerged as a method to mit...
research
10/14/2020

Data Augmentation for Meta-Learning

Conventional image classifiers are trained by randomly sampling mini-bat...
research
10/25/2022

Learning to Augment via Implicit Differentiation for Domain Generalization

Machine learning models are intrinsically vulnerable to domain shift bet...
research
07/06/2020

Counterfactual Data Augmentation using Locally Factored Dynamics

Many dynamic processes, including common scenarios in robotic control an...
research
10/19/2022

Data-Augmented Counterfactual Learning for Bundle Recommendation

Bundle Recommendation (BR) aims at recommending bundled items on online ...

Please sign up or login with your details

Forgot password? Click here to reset