Guided Image Synthesis via Initial Image Editing in Diffusion Model

05/05/2023
by   Jiafeng Mao, et al.
0

Diffusion models have the ability to generate high quality images by denoising pure Gaussian noise images. While previous research has primarily focused on improving the control of image generation through adjusting the denoising process, we propose a novel direction of manipulating the initial noise to control the generated image. Through experiments on stable diffusion, we show that blocks of pixels in the initial latent images have a preference for generating specific content, and that modifying these blocks can significantly influence the generated image. In particular, we show that modifying a part of the initial image affects the corresponding region of the generated image while leaving other regions unaffected, which is useful for repainting tasks. Furthermore, we find that the generation preferences of pixel blocks are primarily determined by their values, rather than their position. By moving pixel blocks with a tendency to generate user-desired content to user-specified regions, our approach achieves state-of-the-art performance in layout-to-image generation. Our results highlight the flexibility and power of initial image manipulation in controlling the generated image.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

page 8

research
08/06/2021

ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models

Denoising diffusion probabilistic models (DDPM) have shown remarkable pe...
research
05/30/2023

Nested Diffusion Processes for Anytime Image Generation

Diffusion models are the current state-of-the-art in image generation, s...
research
12/12/2022

The Stable Artist: Steering Semantics in Diffusion Latent Space

Large, text-conditioned generative diffusion models have recently gained...
research
08/26/2022

Adaptively-Realistic Image Generation from Stroke and Sketch with Diffusion Model

Generating images from hand-drawings is a crucial and fundamental task i...
research
11/25/2022

SpaText: Spatio-Textual Representation for Controllable Image Generation

Recent text-to-image diffusion models are able to generate convincing re...
research
09/06/2023

My Art My Choice: Adversarial Protection Against Unruly AI

Generative AI is on the rise, enabling everyone to produce realistic con...
research
07/11/2023

TIAM – A Metric for Evaluating Alignment in Text-to-Image Generation

The progress in the generation of synthetic images has made it crucial t...

Please sign up or login with your details

Forgot password? Click here to reset