Face2Text revisited: Improved data set and baseline results

05/24/2022
by   Marc Tanti, et al.
0

Current image description generation models do not transfer well to the task of describing human faces. To encourage the development of more human-focused descriptions, we developed a new data set of facial descriptions based on the CelebA image data set. We describe the properties of this data set, and present results from a face description generator trained on it, which explores the feasibility of using transfer learning from VGGFace/ResNet CNNs. Comparisons are drawn through both automated metrics and human evaluation by 76 English-speaking participants. The descriptions generated by the VGGFace-LSTM + Attention model are closest to the ground truth according to human evaluation whilst the ResNet-LSTM + Attention model obtained the highest CIDEr and CIDEr-D results (1.252 and 0.686 respectively). Together, the new data set and these experimental results provide data and baselines for future work in this area.

READ FULL TEXT
research
06/15/2020

On the use of human reference data for evaluating automatic image descriptions

Automatic image description systems are commonly trained and evaluated u...
research
07/22/2019

VIFIDEL: Evaluating the Visual Fidelity of Image Descriptions

We address the task of evaluating image description generation systems. ...
research
05/21/2022

Context Matters for Image Descriptions for Accessibility: Challenges for Referenceless Evaluation Metrics

Few images on the Web receive alt-text descriptions that would make them...
research
11/20/2014

CIDEr: Consensus-based Image Description Evaluation

Automatically describing an image with a sentence is a long-standing cha...
research
04/26/2017

Punny Captions: Witty Wordplay in Image Descriptions

Wit is a quintessential form of rich inter-human interaction, and is oft...
research
11/07/2021

NarrationBot and InfoBot: A Hybrid System for Automated Video Description

Video accessibility is crucial for blind and low vision users for equita...
research
10/26/2020

Open Smartphone Data for Structured Mobility and Utilization Analysis in Ubiquitous Systems

The development and evaluation of new data mining methods for ubiquitous...

Please sign up or login with your details

Forgot password? Click here to reset