Unified Generative Adversarial Networks for Controllable Image-to-Image Translation

12/12/2019
by   Hao Tang, et al.
32

Controllable image-to-image translation, i.e., transferring an image from a source domain to a target one guided by controllable structures, has attracted much attention in both academia and industry. In this paper, we propose a unified Generative Adversarial Network (GAN) framework for controllable image-to-image translation. In addition to conditioning on a reference image, we show how the model can generate images conditioned on controllable structures, e.g., class labels, object keypoints, human skeletons and scene semantic maps. The proposed GAN framework consists of a single generator and a discriminator taking a conditional image and the target controllable structure as input. In this way, the conditional image can provide appearance information and the controllable structure can provide the structure information for generating the target result. Moreover, the proposed GAN learns the image-to-image mapping through three novel losses, i.e., color loss, controllable structure-guided cycle-consistency loss and controllable structure-guided self-identity preserving loss. Note that the proposed color loss handles the issue of "channel pollution" when back-propagating the gradients. In addition, we present the Fréchet ResNet Distance (FRD) to evaluate the quality of generated images. Extensive qualitative and quantitative experiments on two challenging image translation tasks with four different datasets demonstrate that the proposed GAN model generates convincing results, and significantly outperforms other state-of-the-art methods on both tasks. Meanwhile, the proposed GAN framework is a unified solution, thus it can be applied to solving other controllable structure-guided image-to-image translation tasks, such as landmark-guided facial expression translation and keypoint-guided person image generation.

READ FULL TEXT

page 1

page 4

page 5

page 7

page 8

page 9

page 11

research
08/19/2019

SPA-GAN: Spatial Attention GAN for Image-to-Image Translation

Image-to-image translation is to learn a mapping between images from a s...
research
02/10/2023

MaskSketch: Unpaired Structure-guided Masked Image Generation

Recent conditional image generation methods produce images of remarkable...
research
08/05/2022

Memory-Guided Collaborative Attention for Nighttime Thermal Infrared Image Colorization

Nighttime thermal infrared (NTIR) image colorization, also known as tran...
research
04/14/2020

Line Art Correlation Matching Network for Automatic Animation Colorization

Automatic animation line art colorization is a challenging computer visi...
research
12/25/2019

Asymmetric GAN for Unpaired Image-to-image Translation

Unpaired image-to-image translation problem aims to model the mapping fr...
research
10/11/2019

Illegible Text to Readable Text: An Image-to-Image Transformation using Conditional Sliced Wasserstein Adversarial Networks

Automatic text recognition from ancient handwritten record images is an ...
research
12/04/2017

Face Translation between Images and Videos using Identity-aware CycleGAN

This paper presents a new problem of unpaired face translation between i...

Please sign up or login with your details

Forgot password? Click here to reset