Remember What You have drawn: Semantic Image Manipulation with Memory

07/27/2021
by   Xiangxi Shi, et al.
7

Image manipulation with natural language, which aims to manipulate images with the guidance of language descriptions, has been a challenging problem in the fields of computer vision and natural language processing (NLP). Currently, a number of efforts have been made for this task, but their performances are still distant away from generating realistic and text-conformed manipulated images. Therefore, in this paper, we propose a memory-based Image Manipulation Network (MIM-Net), where a set of memories learned from images is introduced to synthesize the texture information with the guidance of the textual description. We propose a two-stage network with an additional reconstruction stage to learn the latent memories efficiently. To avoid the unnecessary background changes, we propose a Target Localization Unit (TLU) to focus on the manipulation of the region mentioned by the text. Moreover, to learn a robust memory, we further propose a novel randomized memory training loss. Experiments on the four popular datasets show the better performance of our method compared to the existing ones.

READ FULL TEXT

page 1

page 2

page 6

page 7

research
07/21/2017

Semantic Image Synthesis via Adversarial Learning

In this paper, we propose a way of synthesizing realistic images directl...
research
02/25/2021

IMAGETOTEXT: IMAGE CAPTION GENERATION USING HYBRID RECURRENT NEURAL NETWORK

Generating a natural language description from images is an important pr...
research
04/10/2019

Text Guided Person Image Synthesis

This paper presents a novel method to manipulate the visual appearance (...
research
08/12/2018

Language Guided Fashion Image Manipulation with Feature-wise Transformations

Developing techniques for editing an outfit image through natural senten...
research
03/31/2021

StyleCLIP: Text-Driven Manipulation of StyleGAN Imagery

Inspired by the ability of StyleGAN to generate highly realistic images ...
research
03/10/2019

Contextualised concept embedding for efficiently adapting natural language processing models for phenotype identification

Many efforts have been put to use automated approaches, such as natural ...

Please sign up or login with your details

Forgot password? Click here to reset