Design Booster: A Text-Guided Diffusion Model for Image Translation with Spatial Layout Preservation

02/05/2023
by   Shiqi Sun, et al.
0

Diffusion models are able to generate photorealistic images in arbitrary scenes. However, when applying diffusion models to image translation, there exists a trade-off between maintaining spatial structure and high-quality content. Besides, existing methods are mainly based on test-time optimization or fine-tuning model for each input image, which are extremely time-consuming for practical applications. To address these issues, we propose a new approach for flexible image translation by learning a layout-aware image condition together with a text condition. Specifically, our method co-encodes images and text into a new domain during the training phase. In the inference stage, we can choose images/text or both as the conditions for each time step, which gives users more flexible control over layout and content. Experimental comparisons of our method with state-of-the-art methods demonstrate our model performs best in both style image translation and semantic image translation and took the shortest time.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 6

page 7

page 8

research
06/07/2023

Improving Diffusion-based Image Translation using Asymmetric Gradient Guidance

Diffusion models have shown significant progress in image translation ta...
research
11/22/2022

Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation

Large-scale text-to-image generative models have been a revolutionary br...
research
07/20/2023

BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion

Recent text-to-image diffusion models have demonstrated an astonishing c...
research
02/28/2023

Towards Enhanced Controllability of Diffusion Models

Denoising Diffusion models have shown remarkable capabilities in generat...
research
07/25/2023

Composite Diffusion | whole >= Σparts

For an artist or a graphic designer, the spatial layout of a scene is a ...
research
11/24/2022

Sketch-Guided Text-to-Image Diffusion Models

Text-to-Image models have introduced a remarkable leap in the evolution ...
research
12/25/2019

Controllable and Progressive Image Extrapolation

Image extrapolation aims at expanding the narrow field of view of a give...

Please sign up or login with your details

Forgot password? Click here to reset