C4Synth: Cross-Caption Cycle-Consistent Text-to-Image Synthesis

09/20/2018
by   K J Joseph, et al.
0

Generating an image from its description is a challenging task worth solving because of its numerous practical applications ranging from image editing to virtual reality. All existing methods use one single caption to generate a plausible image. A single caption by itself, can be limited, and may not be able to capture the variety of concepts and behavior that may be present in the image. We propose two deep generative models that generate an image by making use of multiple captions describing it. This is achieved by ensuring 'Cross-Caption Cycle Consistency' between the multiple captions and the generated image(s). We report quantitative and qualitative results on the standard Caltech-UCSD Birds (CUB) and Oxford-102 Flowers datasets to validate the efficacy of the proposed approach.

READ FULL TEXT

page 1

page 7

page 8

research
03/25/2019

End-to-End Learning Using Cycle Consistency for Image-to-Caption Transformations

So far, research to generate captions from images has been carried out f...
research
01/05/2023

ANNA: Abstractive Text-to-Image Synthesis with Filtered News Captions

Advancements in Text-to-Image synthesis over recent years have focused m...
research
08/14/2018

Text-to-Image-to-Text Translation using Cycle Consistent Adversarial Networks

Text-to-Image translation has been an active area of research in the rec...
research
07/06/2021

Improving Text-to-Image Synthesis Using Contrastive Learning

The goal of text-to-image synthesis is to generate a visually realistic ...
research
10/09/2019

Text-to-Image Synthesis Based on Machine Generated Captions

Text to Image Synthesis refers to the process of automatic generation of...
research
08/21/2019

Improving Captioning for Low-Resource Languages by Cycle Consistency

Improving the captioning performance on low-resource languages by levera...
research
04/22/2021

METGAN: Generative Tumour Inpainting and Modality Synthesis in Light Sheet Microscopy

Novel multimodal imaging methods are capable of generating extensive, su...

Please sign up or login with your details

Forgot password? Click here to reset