StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks

10/19/2017
by   Han Zhang, et al.
0

Although Generative Adversarial Networks (GANs) have shown remarkable success in various tasks, they still face challenges in generating high quality images. In this paper, we propose Stacked Generative Adversarial Networks (StackGAN) aimed at generating high-resolution photorealistic images. First, we propose a two-stage generative adversarial network architecture, StackGAN-v1, for text-to-image synthesis. The Stage-I GAN sketches primitive shape and colors of the object based on given text description, yielding low-resolution images. The Stage-II GAN takes Stage-I results and text descriptions as inputs, and generates high-resolution images with photo-realistic details. Second, an advanced multi-stage generative adversarial network architecture, StackGAN-v2, is proposed for both conditional and unconditional generative tasks. Our StackGAN-v2 consists of multiple generators and discriminators in a tree-like structure; images at multiple scales corresponding to the same scene are generated from different branches of the tree. StackGAN-v2 shows more stable training behavior than StackGAN-v1 by jointly approximating multiple distributions. Extensive experiments demonstrate that the proposed stacked generative adversarial networks significantly outperform other state-of-the-art methods in generating photo-realistic images.

READ FULL TEXT

page 2

page 8

page 9

page 10

page 11

page 12

page 13

page 14

research
12/10/2016

StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks

Synthesizing high-quality images from text descriptions is a challenging...
research
03/27/2019

Auto-Embedding Generative Adversarial Networks for High Resolution Image Synthesis

Generating images via the generative adversarial network (GAN) has attra...
research
07/06/2022

Text to Image Synthesis using Stacked Conditional Variational Autoencoders and Conditional Generative Adversarial Networks

Synthesizing a realistic image from textual description is a major chall...
research
06/08/2018

Generating Image Sequence from Description with LSTM Conditional GAN

Generating images from word descriptions is a challenging task. Generati...
research
03/04/2021

Robustness Evaluation of Stacked Generative Adversarial Networks using Metamorphic Testing

Synthesising photo-realistic images from natural language is one of the ...
research
07/26/2019

VITAL: A Visual Interpretation on Text with Adversarial Learning for Image Labeling

In this paper, we propose a novel way to interpret text information by e...
research
11/07/2018

Forging new worlds: high-resolution synthetic galaxies with chained generative adversarial networks

Astronomy of the 21st century finds itself with extreme quantities of da...

Please sign up or login with your details

Forgot password? Click here to reset