Learning to Globally Edit Images with Textual Description

10/13/2018
by   Hai Wang, et al.
0

We show how we can globally edit images using textual instructions: given a source image and a textual instruction for the edit, generate a new image transformed under this instruction. To tackle this novel problem, we develop three different trainable models based on RNN and Generative Adversarial Network (GAN). The models (bucket, filter bank, and end-to-end) differ in how much expert knowledge is encoded, with the most general version being purely end-to-end. To train these systems, we use Amazon Mechanical Turk to collect textual descriptions for around 2000 image pairs sampled from several datasets. Experimental results evaluated on our dataset validate our approaches. In addition, given that the filter bank model is a good compromise between generality and performance, we investigate it further by replacing RNN with Graph RNN, and show that Graph RNN improves performance. To the best of our knowledge, this is the first computational photography work on global image editing that is purely based on free-form textual instructions.

READ FULL TEXT

page 2

page 6

page 8

page 11

page 13

page 14

page 16

page 17

research
03/22/2023

Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions

We propose a method for editing NeRF scenes with text-instructions. Give...
research
06/12/2023

InstructP2P: Learning to Edit 3D Point Clouds with Text Instructions

Enhancing AI systems to perform tasks following human instructions can s...
research
08/15/2020

Graph Edit Distance Reward: Learning to Edit Scene Graph

Scene Graph, as a vital tool to bridge the gap between language domain a...
research
09/21/2020

SSCR: Iterative Language-Based Image Editing via Self-Supervised Counterfactual Reasoning

Iterative Language-Based Image Editing (IL-BIE) tasks follow iterative i...
research
02/11/2020

Adjusting Image Attributes of Localized Regions with Low-level Dialogue

Natural Language Image Editing (NLIE) aims to use natural language instr...
research
08/07/2020

Textual Description for Mathematical Equations

Reading of mathematical expression or equation in the document images is...
research
11/16/2017

Language-Based Image Editing with Recurrent Attentive Models

We investigate the problem of Language-Based Image Editing (LBIE) in thi...

Please sign up or login with your details

Forgot password? Click here to reset