Image Manipulation with Natural Language using Two-sidedAttentive Conditional Generative Adversarial Network

12/16/2019
by   Dawei Zhu, et al.
13

Altering the content of an image with photo editing tools is a tedious task for an inexperienced user. Especially, when modifying the visual attributes of a specific object in an image without affecting other constituents such as background etc. To simplify the process of image manipulation and to provide more control to users, it is better to utilize a simpler interface like natural language. Therefore, in this paper, we address the challenge of manipulating images using natural language description. We propose the Two-sidEd Attentive conditional Generative Adversarial Network (TEA-cGAN) to generate semantically manipulated images while preserving other contents such as background intact. TEA-cGAN uses fine-grained attention both in the generator and discriminator of Generative Adversarial Network (GAN) based framework at different scales. Experimental results show that TEA-cGAN which generates 128x128 and 256x256 resolution images outperforms existing methods on CUB and Oxford-102 datasets both quantitatively and qualitatively.

READ FULL TEXT

page 2

page 3

page 9

page 10

page 11

page 12

page 16

page 17

research
10/29/2018

Text-Adaptive Generative Adversarial Networks: Manipulating Images with Natural Language

This paper addresses the problem of manipulating images using natural la...
research
03/18/2019

Bilinear Representation for Language-based Image Editing Using Conditional Generative Adversarial Networks

The task of Language-Based Image Editing (LBIE) aims at generating a tar...
research
09/12/2016

Generative Visual Manipulation on the Natural Image Manifold

Realistic image manipulation is challenging because it requires modifyin...
research
10/23/2020

Lightweight Generative Adversarial Networks for Text-Guided Image Manipulation

We propose a novel lightweight generative adversarial network for effici...
research
11/24/2018

Generate, Segment and Replace: Towards Generic Manipulation Segmentation

It has been witnessed an emerging demand for image manipulation segmenta...
research
04/20/2019

Everyone is a Cartoonist: Selfie Cartoonization with Attentive Adversarial Networks

Selfie and cartoon are two popular artistic forms that are widely presen...
research
08/12/2019

Deep Tone Mapping Operator for High Dynamic Range Images

A computationally fast tone mapping operator (TMO) that can quickly adap...

Please sign up or login with your details

Forgot password? Click here to reset