Not Just Pretty Pictures: Text-to-Image Generators Enable Interpretable Interventions for Robust Representations

12/21/2022
by   Jianhao Yuan, et al.
0

Neural image classifiers are known to undergo severe performance degradation when exposed to input that exhibits covariate-shift with respect to the training distribution. Successful hand-crafted augmentation pipelines aim at either approximating the expected test domain conditions or to perturb the features that are specific to the training environment. The development of effective pipelines is typically cumbersome, and produce transformations whose impact on the classifier performance are hard to understand and control. In this paper, we show that recent Text-to-Image (T2I) generators' ability to simulate image interventions via natural-language prompts can be leveraged to train more robust models, offering a more interpretable and controllable alternative to traditional augmentation methods. We find that a variety of prompting mechanisms are effective for producing synthetic training data sufficient to achieve state-of-the-art performance in widely-adopted domain-generalization benchmarks and reduce classifiers' dependency on spurious features. Our work suggests that further progress in T2I generation and a tighter integration with other research fields may represent a significant step towards the development of more robust machine learning systems.

READ FULL TEXT

page 1

page 4

page 21

page 22

page 23

research
12/06/2022

A Learning Based Hypothesis Test for Harmful Covariate Shift

The ability to quickly and accurately identify covariate shift at test t...
research
10/09/2020

Counterfactually-Augmented SNLI Training Data Does Not Yield Better Generalization Than Unaugmented Data

A growing body of work shows that models exploit annotation artifacts to...
research
08/19/2022

Text to Image Generation: Leaving no Language Behind

One of the latest applications of Artificial Intelligence (AI) is to gen...
research
03/26/2021

Data Augmentation in Natural Language Processing: A Novel Text Generation Approach for Long and Short Text Classifiers

In many cases of machine learning, research suggests that the developmen...
research
08/19/2023

ASPIRE: Language-Guided Augmentation for Robust Image Classification

Neural image classifiers can often learn to make predictions by overly r...
research
02/04/2021

Controlling Hallucinations at Word Level in Data-to-Text Generation

Data-to-Text Generation (DTG) is a subfield of Natural Language Generati...
research
09/11/2023

Exploring Minecraft Settlement Generators with Generative Shift Analysis

With growing interest in Procedural Content Generation (PCG) it becomes ...

Please sign up or login with your details

Forgot password? Click here to reset