Chroma-VAE: Mitigating Shortcut Learning with Generative Classifiers

11/28/2022
by   Wanqian Yang, et al.
0

Deep neural networks are susceptible to shortcut learning, using simple features to achieve low training loss without discovering essential semantic structure. Contrary to prior belief, we show that generative models alone are not sufficient to prevent shortcut learning, despite an incentive to recover a more comprehensive representation of the data than discriminative approaches. However, we observe that shortcuts are preferentially encoded with minimal information, a fact that generative models can exploit to mitigate shortcut learning. In particular, we propose Chroma-VAE, a two-pronged approach where a VAE classifier is initially trained to isolate the shortcut in a small latent subspace, allowing a secondary classifier to be trained on the complementary, shortcut-free latent subspace. In addition to demonstrating the efficacy of Chroma-VAE on benchmark and real-world shortcut learning tasks, our work highlights the potential for manipulating the latent space of generative classifiers to isolate or interpret specific correlations.

READ FULL TEXT

page 3

page 4

page 7

page 8

page 9

page 17

page 18

research
12/14/2018

Learning Latent Subspaces in Variational Autoencoders

Variational autoencoders (VAEs) are widely used deep generative models c...
research
09/12/2018

Coordinated Heterogeneous Distributed Perception based on Latent Space Representation

We investigate a reinforcement approach for distributed sensing based on...
research
06/14/2021

A learned conditional prior for the VAE acoustic space of a TTS system

Many factors influence speech yielding different renditions of a given s...
research
02/17/2018

Interpretable VAEs for nonlinear group factor analysis

Deep generative models have recently yielded encouraging results in prod...
research
06/01/2022

Top-down inference in an early visual cortex inspired hierarchical Variational Autoencoder

Interpreting computations in the visual cortex as learning and inference...
research
05/19/2023

A One-Class Classifier for the Detection of GAN Manipulated Multi-Spectral Satellite Images

The highly realistic image quality achieved by current image generative ...
research
06/16/2023

Vacant Holes for Unsupervised Detection of the Outliers in Compact Latent Representation

Detection of the outliers is pivotal for any machine learning model depl...

Please sign up or login with your details

Forgot password? Click here to reset