DSE-GAN: Dynamic Semantic Evolution Generative Adversarial Network for Text-to-Image Generation

09/03/2022
by   Mengqi Huang, et al.
0

Text-to-image generation aims at generating realistic images which are semantically consistent with the given text. Previous works mainly adopt the multi-stage architecture by stacking generator-discriminator pairs to engage multiple adversarial training, where the text semantics used to provide generation guidance remain static across all stages. This work argues that text features at each stage should be adaptively re-composed conditioned on the status of the historical stage (i.e., historical stage's text and image features) to provide diversified and accurate semantic guidance during the coarse-to-fine generation process. We thereby propose a novel Dynamical Semantic Evolution GAN (DSE-GAN) to re-compose each stage's text features under a novel single adversarial multi-stage architecture. Specifically, we design (1) Dynamic Semantic Evolution (DSE) module, which first aggregates historical image features to summarize the generative feedback, and then dynamically selects words required to be re-composed at each stage as well as re-composed them by dynamically enhancing or suppressing different granularity subspace's semantics. (2) Single Adversarial Multi-stage Architecture (SAMA), which extends the previous structure by eliminating complicated multiple adversarial training requirements and therefore allows more stages of text-image interactions, and finally facilitates the DSE module. We conduct comprehensive experiments and show that DSE-GAN achieves 7.48% and 37.8% relative FID improvement on two widely used benchmarks, i.e., CUB-200 and MSCOCO, respectively.

READ FULL TEXT

page 7

page 8

research
11/28/2017

AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks

In this paper, we propose an Attentional Generative Adversarial Network ...
research
04/01/2021

Text to Image Generation with Semantic-Spatial Aware GAN

A text to image generation (T2I) model aims to generate photo-realistic ...
research
04/17/2022

DR-GAN: Distribution Regularization for Text-to-Image Generation

This paper presents a new Text-to-Image generation model, named Distribu...
research
04/13/2023

ALR-GAN: Adaptive Layout Refinement for Text-to-Image Synthesis

We propose a novel Text-to-Image Generation Network, Adaptive Layout Ref...
research
08/19/2023

EGANS: Evolutionary Generative Adversarial Network Search for Zero-Shot Learning

Zero-shot learning (ZSL) aims to recognize the novel classes which canno...
research
07/02/2020

PerceptionGAN: Real-world Image Construction from Provided Text through Perceptual Understanding

Generating an image from a provided descriptive text is quite a challeng...
research
05/18/2019

Variational Hetero-Encoder Randomized Generative Adversarial Networks for Joint Image-Text Modeling

For bidirectional joint image-text modeling, we develop variational hete...

Please sign up or login with your details

Forgot password? Click here to reset