Generate Anything Anywhere in Any Scene

06/29/2023
by   Yuheng Li, et al.
0

Text-to-image diffusion models have attracted considerable interest due to their wide applicability across diverse fields. However, challenges persist in creating controllable models for personalized object generation. In this paper, we first identify the entanglement issues in existing personalized generative models, and then propose a straightforward and efficient data augmentation training strategy that guides the diffusion model to focus solely on object identity. By inserting the plug-and-play adapter layers from a pre-trained controllable diffusion model, our model obtains the ability to control the location and size of each generated personalized object. During inference, we propose a regionally-guided sampling technique to maintain the quality and fidelity of the generated images. Our method achieves comparable or superior fidelity for personalized objects, yielding a robust, versatile, and controllable text-to-image diffusion model that is capable of generating realistic and personalized images. Our approach demonstrates significant potential for various applications, such as those in art, entertainment, and advertising design.

READ FULL TEXT

page 1

page 4

page 5

page 6

page 8

page 9

page 11

page 12

research
08/03/2023

DiffColor: Toward High Fidelity Text-Guided Image Colorization with Diffusion Models

Recent data-driven image colorization methods have enabled automatic or ...
research
07/10/2023

AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning

With the advance of text-to-image models (e.g., Stable Diffusion) and co...
research
09/08/2023

AdBooster: Personalized Ad Creative Generation using Stable Diffusion Outpainting

In digital advertising, the selection of the optimal item (recommendatio...
research
12/02/2022

ObjectStitch: Generative Object Compositing

Object compositing based on 2D images is a challenging problem since it ...
research
06/10/2023

Language-Guided Traffic Simulation via Scene-Level Diffusion

Realistic and controllable traffic simulation is a core capability that ...
research
02/25/2023

Directed Diffusion: Direct Control of Object Placement through Attention Guidance

Text-guided diffusion models such as DALLE-2, IMAGEN, and Stable Diffusi...
research
09/30/2019

Towards Controllable and Personalized Review Generation

In this paper, we propose a novel model RevGAN that automatically genera...

Please sign up or login with your details

Forgot password? Click here to reset