Training on Thin Air: Improve Image Classification with Generated Data

05/24/2023
by   Yongchao Zhou, et al.
0

Acquiring high-quality data for training discriminative models is a crucial yet challenging aspect of building effective predictive systems. In this paper, we present Diffusion Inversion, a simple yet effective method that leverages the pre-trained generative model, Stable Diffusion, to generate diverse, high-quality training data for image classification. Our approach captures the original data distribution and ensures data coverage by inverting images to the latent space of Stable Diffusion, and generates diverse novel training images by conditioning the generative model on noisy versions of these vectors. We identify three key components that allow our generated images to successfully supplant the original dataset, leading to a 2-3x enhancement in sample complexity and a 6.5x decrease in sampling time. Moreover, our approach consistently outperforms generic prompt-based steering methods and KNN retrieval baseline across a wide range of datasets. Additionally, we demonstrate the compatibility of our approach with widely-used data augmentation techniques, as well as the reliability of the generated data in supporting various neural architectures and enhancing few-shot learning.

READ FULL TEXT

page 5

page 9

page 19

page 20

page 21

research
09/01/2023

DiffuGen: Adaptable Approach for Generating Labeled Image Datasets using Stable Diffusion Models

Generating high-quality labeled image datasets is crucial for training a...
research
01/12/2023

Diffusion-based Data Augmentation for Skin Disease Classification: Impact Across Original Medical Datasets to Fully Synthetic Images

Despite continued advancement in recent years, deep neural networks stil...
research
11/03/2022

Evaluating a Synthetic Image Dataset Generated with Stable Diffusion

We generate synthetic images with the "Stable Diffusion" image generatio...
research
03/27/2023

The Stable Signature: Rooting Watermarks in Latent Diffusion Models

Generative image modeling enables a wide range of applications but raise...
research
10/21/2022

Boomerang: Local sampling on image manifolds using diffusion models

Diffusion models can be viewed as mapping points in a high-dimensional l...
research
10/23/2022

Deep Equilibrium Approaches to Diffusion Models

Diffusion-based generative models are extremely effective in generating ...

Please sign up or login with your details

Forgot password? Click here to reset