I2T2I: Learning Text to Image Synthesis with Textual Data Augmentation

03/20/2017
by   Hao Dong, et al.
0

Translating information between text and image is a fundamental problem in artificial intelligence that connects natural language processing and computer vision. In the past few years, performance in image caption generation has seen significant improvement through the adoption of recurrent neural networks (RNN). Meanwhile, text-to-image generation begun to generate plausible images using datasets of specific categories like birds and flowers. We've even seen image generation from multi-category datasets such as the Microsoft Common Objects in Context (MSCOCO) through the use of generative adversarial networks (GANs). Synthesizing objects with a complex shape, however, is still challenging. For example, animals and humans have many degrees of freedom, which means that they can take on many complex shapes. We propose a new training method called Image-Text-Image (I2T2I) which integrates text-to-image and image-to-text (image captioning) synthesis to improve the performance of text-to-image synthesis. We demonstrate that understand the sentence descriptions, so as to I2T2I can generate better multi-categories images using MSCOCO than the state-of-the-art. We also demonstrate that I2T2I can achieve transfer learning by using a pre-trained image captioning module to generate human images on the MPII Human Pose

READ FULL TEXT

page 1

page 4

research
11/29/2018

Shape-conditioned Image Generation by Learning Latent Appearance Representation from Unpaired Data

Conditional image generation is effective for diverse tasks including tr...
research
12/02/2020

A Framework and Dataset for Abstract Art Generation via CalligraphyGAN

With the advancement of deep learning, artificial intelligence (AI) has ...
research
03/30/2018

Guide Me: Interacting with Deep Networks

Interaction and collaboration between humans and intelligent machines ha...
research
08/30/2019

Systematic Analysis of Image Generation using GANs

Generative Adversarial Networks have been crucial in the developments ma...
research
12/18/2017

Synthesizing Novel Pairs of Image and Text

Generating novel pairs of image and text is a problem that combines comp...
research
05/09/2019

The Art of Food: Meal Image Synthesis from Ingredients

In this work we propose a new computational framework, based on generati...
research
04/23/2020

Efficient Neural Architecture for Text-to-Image Synthesis

Text-to-image synthesis is the task of generating images from text descr...

Please sign up or login with your details

Forgot password? Click here to reset