Style-Content Disentanglement in Language-Image Pretraining Representations for Zero-Shot Sketch-to-Image Synthesis

06/03/2022
by   Jan Zuiderveld, et al.
0

In this work, we propose and validate a framework to leverage language-image pretraining representations for training-free zero-shot sketch-to-image synthesis. We show that disentangled content and style representations can be utilized to guide image generators to employ them as sketch-to-image generators without (re-)training any parameters. Our approach for disentangling style and content entails a simple method consisting of elementary arithmetic assuming compositionality of information in representations of input sketches. Our results demonstrate that this approach is competitive with state-of-the-art instance-level open-domain sketch-to-image models, while only depending on pretrained off-the-shelf models and a fraction of the data.

READ FULL TEXT

page 2

page 5

page 6

research
03/29/2023

Sketch-an-Anchor: Sub-epoch Fast Model Adaptation for Zero-shot Sketch-based Image Retrieval

Sketch-an-Anchor is a novel method to train state-of-the-art Zero-shot S...
research
08/12/2020

A Zero-Shot Sketch-based Inter-Modal Object Retrieval Scheme for Remote Sensing Images

Conventional existing retrieval methods in remote sensing (RS) are often...
research
03/08/2019

Semantically Tied Paired Cycle Consistency for Zero-Shot Sketch-based Image Retrieval

Zero-shot sketch-based image retrieval (SBIR) is an emerging task in com...
research
06/01/2023

Learning Disentangled Prompts for Compositional Image Synthesis

We study domain-adaptive image synthesis, the problem of teaching pretra...
research
12/08/2020

Learning Portrait Style Representations

Style analysis of artwork in computer vision predominantly focuses on ac...
research
03/24/2023

VILA: Learning Image Aesthetics from User Comments with Vision-Language Pretraining

Assessing the aesthetics of an image is challenging, as it is influenced...
research
07/07/2020

On Learning Semantic Representations for Million-Scale Free-Hand Sketches

In this paper, we study learning semantic representations for million-sc...

Please sign up or login with your details

Forgot password? Click here to reset