Improving Image Captioning by Leveraging Knowledge Graphs

01/25/2019
by   Yimin Zhou, et al.
2

We explore the use of a knowledge graphs, that capture general or commonsense knowledge, to augment the information extracted from images by the state-of-the-art methods for image captioning. The results of our experiments, on several benchmark data sets such as MS COCO, as measured by CIDEr-D, a performance metric for image captioning, show that the variants of the state-of-the-art methods for image captioning that make use of the information extracted from knowledge graphs can substantially outperform those that rely solely on the information extracted from images.

READ FULL TEXT

page 5

page 7

research
05/26/2019

A Survey on Biomedical Image Captioning

Image captioning applied to biomedical images can assist and accelerate ...
research
09/25/2020

Are scene graphs good enough to improve Image Captioning?

Many top-performing image captioning models rely solely on object featur...
research
04/13/2023

A-CAP: Anticipation Captioning with Commonsense Knowledge

Humans possess the capacity to reason about the future based on a sparse...
research
08/25/2020

Protect, Show, Attend and Tell: Image Captioning Model with Ownership Protection

By and large, existing Intellectual Property Right (IPR) protection on d...
research
06/10/2021

Data augmentation to improve robustness of image captioning solutions

In this paper, we study the impact of motion blur, a common quality flaw...
research
03/30/2016

Rich Image Captioning in the Wild

We present an image caption system that addresses new challenges of auto...
research
05/07/2023

UIT-OpenViIC: A Novel Benchmark for Evaluating Image Captioning in Vietnamese

Image Captioning is one of the vision-language tasks that still interest...

Please sign up or login with your details

Forgot password? Click here to reset