ASPIRE: Language-Guided Augmentation for Robust Image Classification

08/19/2023
by   Sreyan Ghosh, et al.
0

Neural image classifiers can often learn to make predictions by overly relying on non-predictive features that are spuriously correlated with the class labels in the training data. This leads to poor performance in real-world atypical scenarios where such features are absent. Supplementing the training dataset with images without such spurious features can aid robust learning against spurious correlations via better generalization. This paper presents ASPIRE (Language-guided data Augmentation for SPurIous correlation REmoval), a simple yet effective solution for expanding the training dataset with synthetic images without spurious features. ASPIRE, guided by language, generates these images without requiring any form of additional supervision or existing examples. Precisely, we employ LLMs to first extract foreground and background features from textual descriptions of an image, followed by advanced language-guided image editing to discover the features that are spuriously correlated with the class label. Finally, we personalize a text-to-image generation model to generate diverse in-domain images without spurious features. We demonstrate the effectiveness of ASPIRE on 4 datasets, including the very challenging Hard ImageNet dataset, and 9 baselines and show that ASPIRE improves the classification accuracy of prior methods by 1 soon at: https://github.com/Sreyan88/ASPIRE.

READ FULL TEXT

page 3

page 6

page 7

page 12

page 13

page 14

research
05/25/2023

Diversify Your Vision Datasets with Automatic Diffusion-Based Augmentation

Many fine-grained classification tasks, like rare animal identification,...
research
11/18/2022

Invariant Learning via Diffusion Dreamed Distribution Shifts

Though the background is an important signal for image classification, o...
research
06/02/2019

Data Augmentation for Object Detection via Progressive and Selective Instance-Switching

Collection of massive well-annotated samples is effective in improving o...
research
09/12/2023

Beyond Generation: Harnessing Text to Image Models for Object Detection and Segmentation

We propose a new paradigm to automatically generate training data with a...
research
06/21/2023

Towards Mitigating Spurious Correlations in the Wild: A Benchmark a more Realistic Dataset

Deep neural networks often exploit non-predictive features that are spur...
research
12/21/2022

Not Just Pretty Pictures: Text-to-Image Generators Enable Interpretable Interventions for Robust Representations

Neural image classifiers are known to undergo severe performance degrada...
research
11/17/2022

GLAMI-1M: A Multilingual Image-Text Fashion Dataset

We introduce GLAMI-1M: the largest multilingual image-text classificatio...

Please sign up or login with your details

Forgot password? Click here to reset