Multi-Channel Attention Selection GANs for Guided Image-to-Image Translation

02/03/2020
by   Hao Tang, et al.
10

We propose a novel model named Multi-Channel Attention Selection Generative Adversarial Network (SelectionGAN) for guided image-to-image translation, where we translate an input image into another while respecting an external semantic guidance. The proposed SelectionGAN explicitly utilizes the semantic guidance information and consists of two stages. In the first stage, the input image and the conditional semantic guidance are fed into a cycled semantic-guided generation network to produce initial coarse results. In the second stage, we refine the initial results by using the proposed multi-scale spatial pooling & channel selection module and the multi-channel attention selection module. Moreover, uncertainty maps automatically learned from attention maps are used to guide the pixel loss for better network optimization. Exhaustive experiments on four challenging guided image-to-image translation tasks (face, hand, body and street view) demonstrate that our SelectionGAN is able to generate significantly better results than the state-of-the-art methods. Meanwhile, the proposed framework and modules are unified solutions and can be applied to solve other generation tasks, such as semantic image synthesis. The code is available at https://github.com/Ha0Tang/SelectionGAN.

READ FULL TEXT

page 2

page 7

page 8

page 9

page 10

page 11

page 12

page 13

research
04/15/2019

Multi-Channel Attention Selection GAN with Cascaded Semantic Guidance for Cross-View Image Translation

Cross-view image translation is challenging because it involves images w...
research
10/24/2019

Guided Image-to-Image Translation with Bi-Directional Feature Transformation

We address the problem of guided image-to-image translation where we tra...
research
08/04/2021

Deep Portrait Lighting Enhancement with 3D Guidance

Despite recent breakthroughs in deep learning methods for image lighting...
research
10/19/2021

Cascaded Cross MLP-Mixer GANs for Cross-View Image Translation

It is hard to generate an image at target view well for previous cross-v...
research
08/05/2022

Memory-Guided Collaborative Attention for Nighttime Thermal Infrared Image Colorization

Nighttime thermal infrared (NTIR) image colorization, also known as tran...
research
06/21/2021

Total Generate: Cycle in Cycle Generative Adversarial Networks for Generating Human Faces, Hands, Bodies, and Natural Scenes

We propose a novel and unified Cycle in Cycle Generative Adversarial Net...
research
01/18/2019

Learning Spatial Pyramid Attentive Pooling in Image Synthesis and Image-to-Image Translation

Image synthesis and image-to-image translation are two important generat...

Please sign up or login with your details

Forgot password? Click here to reset