Efficient Neural Architecture for Text-to-Image Synthesis

04/23/2020
by   Douglas M. Souza, et al.
4

Text-to-image synthesis is the task of generating images from text descriptions. Image generation, by itself, is a challenging task. When we combine image generation and text, we bring complexity to a new level: we need to combine data from two different modalities. Most of recent works in text-to-image synthesis follow a similar approach when it comes to neural architectures. Due to aforementioned difficulties, plus the inherent difficulty of training GANs at high resolutions, most methods have adopted a multi-stage training strategy. In this paper we shift the architectural paradigm currently used in text-to-image methods and show that an effective neural architecture can achieve state-of-the-art performance using a single stage training with a single generator and a single discriminator. We do so by applying deep residual networks along with a novel sentence interpolation strategy that enables learning a smooth conditional space. Finally, our work points a new direction for text-to-image research, which has not experimented with novel neural architectures recently.

READ FULL TEXT

page 1

page 4

page 6

page 7

research
04/05/2022

DT2I: Dense Text-to-Image Generation from Region Descriptions

Despite astonishing progress, generating realistic images of complex sce...
research
11/30/2021

EAGAN: Efficient Two-stage Evolutionary Architecture Search for GANs

Generative Adversarial Networks (GANs) have been proven hugely successfu...
research
12/05/2019

MetalGAN: Multi-Domain Label-Less Image Synthesis Using cGANs and Meta-Learning

Image synthesis is currently one of the most addressed image processing ...
research
09/28/2022

Adma-GAN: Attribute-Driven Memory Augmented GANs for Text-to-Image Generation

As a challenging task, text-to-image generation aims to generate photo-r...
research
03/20/2017

I2T2I: Learning Text to Image Synthesis with Textual Data Augmentation

Translating information between text and image is a fundamental problem ...
research
10/15/2021

Multi-Tailed, Multi-Headed, Spatial Dynamic Memory refined Text-to-Image Synthesis

Synthesizing high-quality, realistic images from text-descriptions is a ...
research
02/26/2018

Photographic Text-to-Image Synthesis with a Hierarchically-nested Adversarial Network

This paper presents a novel method to deal with the challenging task of ...

Please sign up or login with your details

Forgot password? Click here to reset