Keep Drawing It: Iterative language-based image generation and editing

11/24/2018
by   Alaaeldin El-Nouby, et al.
0

Conditional text-to-image generation approaches commonly focus on generating a single image in a single step. One practical extension beyond one-step generation is an interactive system that generates an image iteratively, conditioned on ongoing linguistic input / feedback. This is significantly more challenging as such a system must understand and keep track of the ongoing context and history. In this work, we present a recurrent image generation model which takes into account both the generated output up to the current step as well as all past instructions for generation. We show that our model is able to generate the background, add new objects, apply simple transformations to existing objects, and correct previous mistakes. We believe our approach is an important step toward interactive generation.

READ FULL TEXT

page 4

page 6

page 7

page 8

page 9

research
06/16/2016

Conditional Image Generation with PixelCNN Decoders

This work explores conditional image generation with a new image density...
research
10/30/2020

MichiGAN: Multi-Input-Conditioned Hair Image Generation for Portrait Editing

Despite the recent success of face image generation with GANs, condition...
research
02/09/2021

Diverse Single Image Generation with Controllable Global Structure through Self-Attention

Image generation from a single image using generative adversarial networ...
research
07/18/2023

PromptCrafter: Crafting Text-to-Image Prompt through Mixed-Initiative Dialogue with LLM

Text-to-image generation model is able to generate images across a diver...
research
05/12/2020

Scones: Towards Conversational Authoring of Sketches

Iteratively refining and critiquing sketches are crucial steps to develo...
research
03/04/2021

MOGAN: Morphologic-structure-aware Generative Learning from a Single Image

In most interactive image generation tasks, given regions of interest (R...
research
02/17/2023

Grimm in Wonderland: Prompt Engineering with Midjourney to Illustrate Fairytales

The quality of text-to-image generation is continuously improving, yet t...

Please sign up or login with your details

Forgot password? Click here to reset