ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation

06/01/2023
by   Shaozhe Hao, et al.
0

Personalized text-to-image generation using diffusion models has recently been proposed and attracted lots of attention. Given a handful of images containing a novel concept (e.g., a unique toy), we aim to tune the generative model to capture fine visual details of the novel concept and generate photorealistic images following a text condition. We present a plug-in method, named ViCo, for fast and lightweight personalized generation. Specifically, we propose an image attention module to condition the diffusion process on the patch-wise visual semantics. We introduce an attention-based object mask that comes almost at no cost from the attention module. In addition, we design a simple regularization based on the intrinsic properties of text-image attention maps to alleviate the common overfitting degradation. Unlike many existing models, our method does not finetune any parameters of the original diffusion model. This allows more flexible and transferable model deployment. With only light parameter training ( 6 comparable or even better performance than all state-of-the-art models both qualitatively and quantitatively.

READ FULL TEXT

page 8

page 9

page 14

page 16

page 19

page 20

page 21

page 22

research
05/18/2023

Discriminative Diffusion Models as Few-shot Vision and Language Learners

Diffusion models, such as Stable Diffusion, have shown incredible perfor...
research
05/30/2023

Nested Diffusion Processes for Anytime Image Generation

Diffusion models are the current state-of-the-art in image generation, s...
research
07/19/2023

FABRIC: Personalizing Diffusion Models with Iterative Feedback

In an era where visual content generation is increasingly driven by mach...
research
09/08/2023

Create Your World: Lifelong Text-to-Image Diffusion

Text-to-image generative models can produce diverse high-quality images ...
research
09/08/2023

From Text to Mask: Localizing Entities Using the Attention of Text-to-Image Diffusion Models

Diffusion models have revolted the field of text-to-image generation rec...
research
03/26/2020

Cycle Text-To-Image GAN with BERT

We explore novel approaches to the task of image generation from their r...
research
07/10/2023

AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning

With the advance of text-to-image models (e.g., Stable Diffusion) and co...

Please sign up or login with your details

Forgot password? Click here to reset