PARASOL: Parametric Style Control for Diffusion Image Synthesis

03/11/2023
by   Gemma Canet Tarres, et al.
0

We propose PARASOL, a multi-modal synthesis model that enables disentangled, parametric control of the visual style of the image by jointly conditioning synthesis on both content and a fine-grained visual style embedding. We train a latent diffusion model (LDM) using specific losses for each modality and adapt the classifer-free guidance for encouraging disentangled control over independent content and style modalities at inference time. We leverage auxiliary semantic and style-based search to create training triplets for supervision of the LDM, ensuring complementarity of content and style cues. PARASOL shows promise for enabling nuanced control over visual style in diffusion models for image creation and stylization, as well as generative search where text-based search results may be adapted to more closely match user intent by interpolating both content and style descriptors.

READ FULL TEXT

page 1

page 3

page 5

page 6

page 7

page 8

research
03/17/2022

CoGS: Controllable Generation and Search from Sketch and Style

We present CoGS, a novel method for the style-conditioned, sketch-driven...
research
10/12/2021

Fine-grained style control in Transformer-based Text-to-speech Synthesis

In this paper, we present a novel architecture to realize fine-grained s...
research
04/24/2023

Hierarchical Diffusion Autoencoders and Disentangled Image Manipulation

Diffusion models have attained impressive visual quality for image synth...
research
03/13/2023

Erasing Concepts from Diffusion Models

Motivated by recent advancements in text-to-image diffusion, we study er...
research
04/20/2023

Not Only Generative Art: Stable Diffusion for Content-Style Disentanglement in Art Analysis

The duality of content and style is inherent to the nature of art. For h...
research
07/26/2022

Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models

Novel architectures have recently improved generative image synthesis le...
research
01/28/2023

SEGA: Instructing Diffusion using Semantic Dimensions

Text-to-image diffusion models have recently received a lot of interest ...

Please sign up or login with your details

Forgot password? Click here to reset