Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models

03/30/2023
by   Eric Zhang, et al.
0

The unlearning problem of deep learning models, once primarily an academic concern, has become a prevalent issue in the industry. The significant advances in text-to-image generation techniques have prompted global discussions on privacy, copyright, and safety, as numerous unauthorized personal IDs, content, artistic creations, and potentially harmful materials have been learned by these models and later utilized to generate and distribute uncontrolled content. To address this challenge, we propose \textbf{Forget-Me-Not}, an efficient and low-cost solution designed to safely remove specified IDs, objects, or styles from a well-configured text-to-image model in as little as 30 seconds, without impairing its ability to generate other content. Alongside our method, we introduce the \textbf{Memorization Score (M-Score)} and \textbf{ConceptBench} to measure the models' capacity to generate general concepts, grouped into three primary categories: ID, object, and style. Using M-Score and ConceptBench, we demonstrate that Forget-Me-Not can effectively eliminate targeted concepts while maintaining the model's performance on other concepts. Furthermore, Forget-Me-Not offers two practical extensions: a) removal of potentially harmful or NSFW content, and b) enhancement of model accuracy, inclusion and diversity through \textbf{concept correction and disentanglement}. It can also be adapted as a lightweight model patch for Stable Diffusion, allowing for concept manipulation and convenient distribution. To encourage future research in this critical area and promote the development of safe and inclusive generative models, we will open-source our code and ConceptBench at \href{https://github.com/SHI-Labs/Forget-Me-Not}{https://github.com/SHI-Labs/Forget-Me-Not}.

READ FULL TEXT

page 1

page 6

page 7

page 8

page 9

page 10

research
02/05/2023

Divide and Compose with Score Based Generative Models

While score based generative models, or diffusion models, have found suc...
research
07/12/2023

Towards Safe Self-Distillation of Internet-Scale Text-to-Image Diffusion Models

Large-scale image generation models, with impressive quality made possib...
research
08/03/2023

Circumventing Concept Erasure Methods For Text-to-Image Generative Models

Text-to-image generative models can produce photo-realistic images for a...
research
03/17/2023

A Recipe for Watermarking Diffusion Models

Recently, diffusion models (DMs) have demonstrated their advantageous po...
research
03/13/2023

Erasing Concepts from Diffusion Models

Motivated by recent advancements in text-to-image diffusion, we study er...
research
03/27/2023

Anti-DreamBooth: Protecting users from personalized text-to-image synthesis

Text-to-image diffusion models are nothing but a revolution, allowing an...
research
08/03/2023

ConceptLab: Creative Generation using Diffusion Prior Constraints

Recent text-to-image generative models have enabled us to transform our ...

Please sign up or login with your details

Forgot password? Click here to reset