ANNA: Abstractive Text-to-Image Synthesis with Filtered News Captions

Advancements in Text-to-Image synthesis over recent years have focused more on improving the quality of generated samples on datasets with descriptive captions. However, real-world image-caption pairs present in domains such as news data do not use simple and directly descriptive captions. With captions containing information on both the image content and underlying contextual cues, they become abstractive in nature. In this paper, we launch ANNA, an Abstractive News captioNs dAtaset extracted from online news articles in a variety of different contexts. We explore the capabilities of current Text-to-Image synthesis models to generate news domain-specific images using abstractive captions by benchmarking them on ANNA, in both standard training and transfer learning settings. The generated images are judged on the basis of contextual relevance, visual quality, and perceptual similarity to ground-truth image-caption pairs. Through our experiments, we show that techniques such as transfer learning achieve limited success in understanding abstractive captions but still fail to consistently learn the relationships between content and context features.

READ FULL TEXT

page 1

page 4

page 6

page 7

page 8

research
07/29/2020

Enriching Video Captions With Contextual Text

Understanding video content and generating caption with context is an im...
research
04/18/2019

Knowledge-rich Image Gist Understanding Beyond Literal Meaning

We investigate the problem of understanding the message (gist) conveyed ...
research
03/23/2016

BreakingNews: Article Annotation by Image and Text Processing

Building upon recent Deep Neural Network architectures, current approach...
research
07/26/2022

NewsStories: Illustrating articles with visual summaries

Recent self-supervised approaches have used large-scale image-text datas...
research
09/20/2018

C4Synth: Cross-Caption Cycle-Consistent Text-to-Image Synthesis

Generating an image from its description is a challenging task worth sol...
research
07/25/2023

EmphasisChecker: A Tool for Guiding Chart and Caption Emphasis

Recent work has shown that when both the chart and caption emphasize the...
research
07/06/2021

Improving Text-to-Image Synthesis Using Contrastive Learning

The goal of text-to-image synthesis is to generate a visually realistic ...

Please sign up or login with your details

Forgot password? Click here to reset