Paint by Word

03/19/2021
by   David Bau, et al.
6

We investigate the problem of zero-shot semantic image painting. Instead of painting modifications into an image using only concrete colors or a finite set of semantic concepts, we ask how to create semantic paint based on open full-text descriptions: our goal is to be able to point to a location in a synthesized image and apply an arbitrary new concept such as "rustic" or "opulent" or "happy dog." To do this, our method combines a state-of-the art generative model of realistic images with a state-of-the-art text-image semantic similarity network. We find that, to make large changes, it is important to use non-gradient methods to explore latent space, and it is important to relax the computations of the GAN to target changes to a specific region. We conduct user studies to compare our methods to several baselines.

READ FULL TEXT

page 1

page 3

page 5

page 6

page 7

page 8

research
12/09/2019

Bi-Semantic Reconstructing Generative Network for Zero-shot Learning

Many recent methods of zero-shot learning (ZSL) attempt to utilize gener...
research
08/21/2023

Improving Diversity in Zero-Shot GAN Adaptation with Semantic Variations

Training deep generative models usually requires a large amount of data....
research
08/08/2022

Txt2Img-MHN: Remote Sensing Image Generation from Text Using Modern Hopfield Networks

The synthesis of high-resolution remote sensing images based on text des...
research
03/09/2022

FlexIT: Towards Flexible Semantic Image Translation

Deep generative models, like GANs, have considerably improved the state ...
research
08/02/2021

StyleGAN-NADA: CLIP-Guided Domain Adaptation of Image Generators

Can a generative model be trained to produce images from a specific doma...
research
02/02/2021

Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search

In this research work we present CLIP-GLaSS, a novel zero-shot framework...
research
01/25/2022

Rayleigh EigenDirections (REDs): GAN latent space traversals for multidimensional features

We present a method for finding paths in a deep generative model's laten...

Please sign up or login with your details

Forgot password? Click here to reset