Text as Neural Operator: Image Manipulation by Text Instruction

08/11/2020
by   Tianhao Zhang, et al.
8

In this paper, we study a new task that allows users to edit an input image using language instructions. In this image generation task, the inputs are a reference image and a text instruction that describes desired modifications to the input image. We propose a GAN-based method to tackle this problem. The key idea is to treat language as neural operators to locally modify the image feature. To this end, our model decomposes the generation process into finding where (spatial region) and how (text operators) to apply modifications. We show that the proposed model performs favorably against recent baselines on three datasets.

READ FULL TEXT

page 2

page 3

page 7

page 8

research
11/26/2022

Target-Free Text-guided Image Manipulation

We tackle the problem of target-free text-guided image manipulation, whi...
research
02/23/2018

Interactive Image Manipulation with Natural Language Instruction Commands

We propose an interactive image-manipulation system with natural languag...
research
02/07/2019

Neural Inverse Knitting: From Images to Manufacturing Instructions

Motivated by the recent potential of mass customization brought by whole...
research
12/18/2018

Composing Text and Image for Image Retrieval - An Empirical Odyssey

In this paper, we study the task of image retrieval, where the input que...
research
08/17/2023

Watch Your Steps: Local Image and Scene Editing by Text Instructions

Denoising diffusion models have enabled high-quality image generation an...
research
10/17/2022

UniTune: Text-Driven Image Editing by Fine Tuning an Image Generation Model on a Single Image

We present UniTune, a simple and novel method for general text-driven im...
research
08/01/2022

Exploring the GLIDE model for Human Action-effect Prediction

We address the following action-effect prediction task. Given an image d...

Please sign up or login with your details

Forgot password? Click here to reset