ManiGAN: Text-Guided Image Manipulation

12/12/2019
by   Bowen Li, et al.
9

The goal of our paper is to semantically edit parts of an image to match a given text that describes desired attributes (e.g., texture, colour, and background), while preserving other contents that are irrelevant to the text. To achieve this, we propose a novel generative adversarial network (ManiGAN), which contains two key components: text-image affine combination module (ACM) and detail correction module (DCM). The ACM selects image regions relevant to the given text and then correlates the regions with corresponding semantic words for effective manipulation. Meanwhile, it encodes original image features to help reconstruct text-irrelevant contents. The DCM rectifies mismatched attributes and completes missing contents of the synthetic image. Finally, we suggest a new metric for evaluating image manipulation results, in terms of both the generation of new attributes and the reconstruction of text-irrelevant contents. Extensive experiments on the CUB and COCO datasets demonstrate the superior performance of the proposed method.

READ FULL TEXT

page 2

page 3

page 6

page 7

page 8

research
10/29/2018

Text-Adaptive Generative Adversarial Networks: Manipulating Images with Natural Language

This paper addresses the problem of manipulating images using natural la...
research
04/09/2022

ManiTrans: Entity-Level Text-Guided Image Manipulation via Token-wise Semantic Alignment and Generation

Existing text-guided image manipulation methods aim to modify the appear...
research
11/25/2022

Interactive Image Manipulation with Complex Text Instructions

Recently, text-guided image manipulation has received increasing attenti...
research
02/22/2023

Entity-Level Text-Guided Image Manipulation

Existing text-guided image manipulation methods aim to modify the appear...
research
10/23/2020

Lightweight Generative Adversarial Networks for Text-Guided Image Manipulation

We propose a novel lightweight generative adversarial network for effici...
research
02/12/2020

Image-to-Image Translation with Text Guidance

The goal of this paper is to embed controllable factors, i.e., natural l...
research
09/25/2019

Focus Your Attention: A Bidirectional Focal Attention Network for Image-Text Matching

Learning semantic correspondence between image and text is significant a...

Please sign up or login with your details

Forgot password? Click here to reset