VisualNews : Benchmark and Challenges in Entity-aware Image Captioning

10/08/2020
by   Fuxiao Liu, et al.
22

In this paper we propose VisualNews-Captioner, an entity-aware model for the task of news image captioning. We also introduce VisualNews, a large-scale benchmark consisting of more than one million news images along with associated news articles, image captions, author information, and other metadata. Unlike the standard image captioning task, news images depict situations where people, locations, and events are of paramount importance. Our proposed method is able to effectively combine visual and textual features to generate captions with richer information such as events and entities. More specifically, we propose an Entity-Aware module along with an Entity-Guide attention layer to encourage more accurate predictions for named entities. Our method achieves state-of-the-art results on both the GoodNews and VisualNews datasets while having significantly fewer parameters than competing methods. Our larger and more diverse VisualNews dataset further highlights the remaining challenges in captioning news images.

READ FULL TEXT

page 1

page 7

research
09/07/2021

Journalistic Guidelines Aware News Image Captioning

The task of news article image captioning aims to generate descriptive a...
research
08/04/2021

ICECAP: Information Concentrated Entity-aware Image Captioning

Most current image captioning systems focus on describing general image ...
research
04/21/2018

Entity-aware Image Caption Generation

Image captioning approaches currently generate descriptions which lack s...
research
02/04/2023

Transform, Contrast and Tell: Coherent Entity-Aware Multi-Image Captioning

Coherent entity-aware multi-image captioning aims to generate coherent c...
research
12/11/2021

Show and Write: Entity-aware News Generation with Image Information

Automatically writing long articles is a complex and challenging languag...
research
04/02/2019

Good News, Everyone! Context driven entity-aware captioning for news images

Current image captioning systems perform at a merely descriptive level, ...
research
04/17/2020

Transform and Tell: Entity-Aware News Image Captioning

We propose an end-to-end model which generates captions for images embed...

Please sign up or login with your details

Forgot password? Click here to reset