Room for improvement in automatic image description: an error analysis

04/13/2017
by   Emiel van Miltenburg, et al.
0

In recent years we have seen rapid and significant progress in automatic image description but what are the open problems in this area? Most work has been evaluated using text-based similarity metrics, which only indicate that there have been improvements, without explaining what has improved. In this paper, we present a detailed error analysis of the descriptions generated by a state-of-the-art attention-based model. Our analysis operates on two levels: first we check the descriptions for accuracy, and then we categorize the types of errors we observe in the inaccurate descriptions. We find only 20 descriptions are free from errors, and surprisingly that 26 the image. Finally, we manually correct the most frequently occurring error types (e.g. gender identification) to estimate the performance reward for addressing these errors, observing gains of 0.2--1 BLEU point per type.

READ FULL TEXT

page 2

page 6

page 7

page 8

research
06/15/2020

On the use of human reference data for evaluating automatic image descriptions

Automatic image description systems are commonly trained and evaluated u...
research
06/29/2021

Evaluation of Automated Image Descriptions for Visually Impaired Students

Illustrations are widely used in education, and sometimes, alternatives ...
research
07/07/2022

Improving Few-Shot Image Classification Using Machine- and User-Generated Natural Language Descriptions

Humans can obtain the knowledge of novel visual concepts from language d...
research
06/20/2016

Pragmatic factors in image description: the case of negations

We provide a qualitative analysis of the descriptions containing negatio...
research
07/06/2017

Cross-linguistic differences and similarities in image descriptions

Automatic image description systems are commonly trained and evaluated o...
research
05/29/2019

Ensuring Readability and Data-fidelity using Head-modifier Templates in Deep Type Description Generation

A type description is a succinct noun compound which helps human and mac...
research
06/13/2022

Knowledge Graph Construction and Its Application in Automatic Radiology Report Generation from Radiologist's Dictation

Conventionally, the radiologist prepares the diagnosis notes and shares ...

Please sign up or login with your details

Forgot password? Click here to reset