AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning

07/10/2023
by   Yuwei Guo, et al.
0

With the advance of text-to-image models (e.g., Stable Diffusion) and corresponding personalization techniques such as DreamBooth and LoRA, everyone can manifest their imagination into high-quality images at an affordable cost. Subsequently, there is a great demand for image animation techniques to further combine generated static images with motion dynamics. In this report, we propose a practical framework to animate most of the existing personalized text-to-image models once and for all, saving efforts in model-specific tuning. At the core of the proposed framework is to insert a newly initialized motion modeling module into the frozen text-to-image model and train it on video clips to distill reasonable motion priors. Once trained, by simply injecting this motion modeling module, all personalized versions derived from the same base T2I readily become text-driven models that produce diverse and personalized animated images. We conduct our evaluation on several public representative personalized text-to-image models across anime pictures and realistic photographs, and demonstrate that our proposed framework helps these models generate temporally smooth animation clips while preserving the domain and diversity of their outputs. Code and pre-trained weights will be publicly available at https://animatediff.github.io/ .

READ FULL TEXT

page 1

page 6

page 7

page 8

page 9

page 11

page 12

page 13

research
06/29/2023

Generate Anything Anywhere in Any Scene

Text-to-image diffusion models have attracted considerable interest due ...
research
04/18/2023

Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models

Latent Diffusion Models (LDMs) enable high-quality image synthesis while...
research
05/25/2023

Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models

Text-to-image (T2I) research has grown explosively in the past year, owi...
research
03/22/2023

Pix2Video: Video Editing using Image Diffusion

Image diffusion models, trained on massive image collections, have emerg...
research
08/21/2023

Backdooring Textual Inversion for Concept Censorship

Recent years have witnessed success in AIGC (AI Generated Content). Peop...
research
06/01/2023

ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation

Personalized text-to-image generation using diffusion models has recentl...
research
03/27/2023

Anti-DreamBooth: Protecting users from personalized text-to-image synthesis

Text-to-image diffusion models are nothing but a revolution, allowing an...

Please sign up or login with your details

Forgot password? Click here to reset