SEGA: Instructing Diffusion using Semantic Dimensions

01/28/2023
by   Manuel Brack, et al.
8

Text-to-image diffusion models have recently received a lot of interest for their astonishing ability to produce high-fidelity images from text only. However, achieving one-shot generation that aligns with the user's intent is nearly impossible, yet small changes to the input prompt often result in very different images. This leaves the user with little semantic control. To put the user in control, we show how to interact with the diffusion process to flexibly steer it along semantic directions. This semantic guidance (SEGA) allows for subtle and extensive edits, changes in composition and style, as well as optimizing the overall artistic conception. We demonstrate SEGA's effectiveness on a variety of tasks and provide evidence for its versatility and flexibility.

READ FULL TEXT

page 1

page 3

page 5

page 6

page 8

page 12

page 13

page 14

research
12/12/2022

The Stable Artist: Steering Semantics in Diffusion Latent Space

Large, text-conditioned generative diffusion models have recently gained...
research
02/05/2023

Mixture of Diffusers for scene composition and high resolution image generation

Diffusion methods have been proven to be very effective to generate imag...
research
06/30/2023

Counting Guidance for High Fidelity Text-to-Image Synthesis

Recently, the quality and performance of text-to-image generation signif...
research
07/03/2023

DifFSS: Diffusion Model for Few-Shot Semantic Segmentation

Diffusion models have demonstrated excellent performance in image genera...
research
03/11/2023

PARASOL: Parametric Style Control for Diffusion Image Synthesis

We propose PARASOL, a multi-modal synthesis model that enables disentang...
research
11/30/2022

High-Fidelity Guided Image Synthesis with Latent Diffusion Models

Controllable image synthesis with user scribbles has gained huge public ...
research
10/28/2022

MagicMix: Semantic Mixing with Diffusion Models

Have you ever imagined what a corgi-alike coffee machine or a tiger-alik...

Please sign up or login with your details

Forgot password? Click here to reset