Layout-to-Image Translation with Double Pooling Generative Adversarial Networks

08/29/2021
by   Hao Tang, et al.
1

In this paper, we address the task of layout-to-image translation, which aims to translate an input semantic layout to a realistic image. One open challenge widely observed in existing methods is the lack of effective semantic constraints during the image translation process, leading to models that cannot preserve the semantic information and ignore the semantic dependencies within the same object. To address this issue, we propose a novel Double Pooing GAN (DPGAN) for generating photo-realistic and semantically-consistent results from the input layout. We also propose a novel Double Pooling Module (DPM), which consists of the Square-shape Pooling Module (SPM) and the Rectangle-shape Pooling Module (RPM). Specifically, SPM aims to capture short-range semantic dependencies of the input layout with different spatial scales, while RPM aims to capture long-range semantic dependencies from both horizontal and vertical directions. We then effectively fuse both outputs of SPM and RPM to further enlarge the receptive field of our generator. Extensive experiments on five popular datasets show that the proposed DPGAN achieves better results than state-of-the-art methods. Finally, both SPM and SPM are general and can be seamlessly integrated into any GAN-based architectures to strengthen the feature representation. The code is available at https://github.com/Ha0Tang/DPGAN.

READ FULL TEXT

page 1

page 2

page 3

page 5

page 6

page 7

page 8

page 10

research
08/29/2020

Dual Attention GANs for Semantic Image Synthesis

In this paper, we focus on the semantic image synthesis task that aims a...
research
03/31/2020

Edge Guided GANs with Semantic Preserving for Semantic Image Synthesis

We propose a novel Edge guided Generative Adversarial Network (EdgeGAN) ...
research
03/30/2020

Strip Pooling: Rethinking Spatial Pooling for Scene Parsing

Spatial pooling has been proven highly effective in capturing long-range...
research
04/15/2019

Multi-Channel Attention Selection GAN with Cascaded Semantic Guidance for Cross-View Image Translation

Cross-view image translation is challenging because it involves images w...
research
07/22/2023

Edge Guided GANs with Multi-Scale Contrastive Learning for Semantic Image Synthesis

We propose a novel ECGAN for the challenging semantic image synthesis ta...
research
09/18/2023

LayoutNUWA: Revealing the Hidden Layout Expertise of Large Language Models

Graphic layout generation, a growing research field, plays a significant...
research
07/06/2021

Independent Encoder for Deep Hierarchical Unsupervised Image-to-Image Translation

The main challenges of image-to-image (I2I) translation are to make the ...

Please sign up or login with your details

Forgot password? Click here to reset