Circumventing Concept Erasure Methods For Text-to-Image Generative Models

08/03/2023
by   Minh Pham, et al.
0

Text-to-image generative models can produce photo-realistic images for an extremely broad range of concepts, and their usage has proliferated widely among the general public. On the flip side, these models have numerous drawbacks, including their potential to generate images featuring sexually explicit content, mirror artistic styles without permission, or even hallucinate (or deepfake) the likenesses of celebrities. Consequently, various methods have been proposed in order to "erase" sensitive concepts from text-to-image models. In this work, we examine five recently proposed concept erasure methods, and show that targeted concepts are not fully excised from any of these methods. Specifically, we leverage the existence of special learned word embeddings that can retrieve "erased" concepts from the sanitized models with no alterations to their weights. Our results highlight the brittleness of post hoc concept erasure methods, and call into question their use in the algorithmic toolkit for AI safety.

READ FULL TEXT

page 1

page 7

page 8

page 11

research
06/08/2023

Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models

Text-to-image generative models have enabled high-resolution image synth...
research
03/30/2023

Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models

The unlearning problem of deep learning models, once primarily an academ...
research
09/08/2023

Create Your World: Lifelong Text-to-Image Diffusion

Text-to-image generative models can produce diverse high-quality images ...
research
06/09/2023

Safety and Fairness for Content Moderation in Generative Models

With significant advances in generative AI, new technologies are rapidly...
research
12/08/2022

Multi-Concept Customization of Text-to-Image Diffusion

While generative models produce high-quality images of concepts learned ...
research
01/11/2023

ChatGPT is not all you need. A State of the Art Review of large Generative AI models

During the last two years there has been a plethora of large generative ...
research
09/14/2021

Design Guidelines for Prompt Engineering Text-to-Image Generative Models

Text-to-image generative models are a new and powerful way to generate v...

Please sign up or login with your details

Forgot password? Click here to reset