Understanding and Mitigating Copying in Diffusion Models

05/31/2023
by   Gowthami Somepalli, et al.
0

Images generated by diffusion models like Stable Diffusion are increasingly widespread. Recent works and even lawsuits have shown that these models are prone to replicating their training data, unbeknownst to the user. In this paper, we first analyze this memorization problem in text-to-image diffusion models. While it is widely believed that duplicated images in the training set are responsible for content replication at inference time, we observe that the text conditioning of the model plays a similarly important role. In fact, we see in our experiments that data replication often does not happen for unconditional models, while it is common in the text-conditional case. Motivated by our findings, we then propose several techniques for reducing data replication at both training and inference time by randomizing and augmenting image captions in the training set.

READ FULL TEXT

page 2

page 6

page 9

page 12

page 13

page 15

page 17

research
12/07/2022

Diffusion Art or Digital Forgery? Investigating Data Replication in Diffusion Models

Cutting-edge diffusion models produce images with high quality and custo...
research
09/13/2023

Mitigate Replication and Copying in Diffusion Models with Generalized Caption and Dual Fusion Enhancement

While diffusion models demonstrate a remarkable capability for generatin...
research
06/02/2023

Conditional Generation from Unconditional Diffusion Models using Denoiser Representations

Denoising diffusion models have gained popularity as a generative modeli...
research
11/23/2022

Improving dermatology classifiers across populations using images generated by large diffusion models

Dermatological classification algorithms developed without sufficiently ...
research
02/23/2022

When do GANs replicate? On the choice of dataset size

Do GANs replicate training images? Previous studies have shown that GANs...
research
06/09/2023

Boosting GUI Prototyping with Diffusion Models

GUI (graphical user interface) prototyping is a widely-used technique in...
research
08/02/2023

Reverse Stable Diffusion: What prompt was used to generate this image?

Text-to-image diffusion models such as Stable Diffusion have recently at...

Please sign up or login with your details

Forgot password? Click here to reset