ICECAP: Information Concentrated Entity-aware Image Captioning

08/04/2021
by   Anwen Hu, et al.
0

Most current image captioning systems focus on describing general image content, and lack background knowledge to deeply understand the image, such as exact named entities or concrete events. In this work, we focus on the entity-aware news image captioning task which aims to generate informative captions by leveraging the associated news articles to provide background knowledge about the target image. However, due to the length of news articles, previous works only employ news articles at the coarse article or sentence level, which are not fine-grained enough to refine relevant events and choose named entities accurately. To overcome these limitations, we propose an Information Concentrated Entity-aware news image CAPtioning (ICECAP) model, which progressively concentrates on relevant textual information within the corresponding news article from the sentence level to the word level. Our model first creates coarse concentration on relevant sentences using a cross-modality retrieval model and then generates captions by further concentrating on relevant words within the sentences. Extensive experiments on both BreakingNews and GoodNews datasets demonstrate the effectiveness of our proposed method, which outperforms other state-of-the-arts. The code of ICECAP is publicly available at https://github.com/HAWLYQ/ICECAP.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/08/2020

VisualNews : Benchmark and Challenges in Entity-aware Image Captioning

In this paper we propose VisualNews-Captioner, an entity-aware model for...
research
12/01/2022

Focus! Relevant and Sufficient Context Selection for News Image Captioning

News Image Captioning requires describing an image by leveraging additio...
research
07/26/2021

Boosting Entity-aware Image Captioning with Multi-modal Knowledge Graph

Entity-aware image captioning aims to describe named entities and events...
research
09/21/2022

Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia

Humans exploit prior knowledge to describe images, and are able to adapt...
research
06/20/2019

Informative Image Captioning with External Sources of Information

An image caption should fluently present the essential information in a ...
research
11/25/2022

Aesthetically Relevant Image Captioning

Image aesthetic quality assessment (AQA) aims to assign numerical aesthe...
research
04/21/2018

Entity-aware Image Caption Generation

Image captioning approaches currently generate descriptions which lack s...

Please sign up or login with your details

Forgot password? Click here to reset