On the use of human reference data for evaluating automatic image descriptions

06/15/2020
by   Emiel van Miltenburg, et al.
0

Automatic image description systems are commonly trained and evaluated using crowdsourced, human-generated image descriptions. The best-performing system is then determined using some measure of similarity to the reference data (BLEU, Meteor, CIDER, etc). Thus, both the quality of the systems as well as the quality of the evaluation depends on the quality of the descriptions. As Section 2 will show, the quality of current image description datasets is insufficient. I argue that there is a need for more detailed guidelines that take into account the needs of visually impaired users, but also the feasibility of generating suitable descriptions. With high-quality data, evaluation of image description systems could use reference descriptions, but we should also look for alternatives.

READ FULL TEXT

page 1

page 2

research
07/22/2019

VIFIDEL: Evaluating the Visual Fidelity of Image Descriptions

We address the task of evaluating image description generation systems. ...
research
06/29/2021

Evaluation of Automated Image Descriptions for Visually Impaired Students

Illustrations are widely used in education, and sometimes, alternatives ...
research
04/13/2017

Room for improvement in automatic image description: an error analysis

In recent years we have seen rapid and significant progress in automatic...
research
04/26/2017

Punny Captions: Witty Wordplay in Image Descriptions

Wit is a quintessential form of rich inter-human interaction, and is oft...
research
05/14/2022

ACCoRD: A Multi-Document Approach to Generating Diverse Descriptions of Scientific Concepts

Systems that can automatically define unfamiliar terms hold the promise ...
research
05/24/2022

Face2Text revisited: Improved data set and baseline results

Current image description generation models do not transfer well to the ...
research
12/24/2017

Semi-automatic definite description annotation: a first report

Studies in Referring Expression Generation (REG) often make use of corpo...

Please sign up or login with your details

Forgot password? Click here to reset