Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures

01/15/2016
by   Raffaella Bernardi, et al.
0

Automatic description generation from natural images is a challenging problem that has recently received a large amount of interest from the computer vision and natural language processing communities. In this survey, we classify the existing approaches based on how they conceptualize this problem, viz., models that cast description as either generation problem or as a retrieval problem over a visual or multimodal representational space. We provide a detailed review of existing models, highlighting their advantages and disadvantages. Moreover, we give an overview of the benchmark image datasets and the evaluation measures that have been developed to assess the quality of machine-generated image descriptions. Finally we extrapolate future directions in the area of automatic image description generation.

READ FULL TEXT

page 11

page 17

page 20

research
06/01/2018

Video Description: A Survey of Methods, Datasets and Evaluation Metrics

Automatic video description is useful for assisting the visually impaire...
research
07/22/2019

Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods

Integration of vision and language tasks has seen a significant growth i...
research
09/21/2019

Visuallly Grounded Generation of Entailments from Premises

Natural Language Inference (NLI) is the task of determining the semantic...
research
02/13/2013

Object Recognition with Imperfect Perception and Redundant Description

This paper deals with a scene recognition system in a robotics contex. T...
research
01/06/2022

Automatic Related Work Generation: A Meta Study

Academic research is an exploration activity to solve problems that have...
research
02/09/2021

The Role of the Input in Natural Language Video Description

Natural Language Video Description (NLVD) has recently received strong i...
research
04/05/2017

Generating Descriptions with Grounded and Co-Referenced People

Learning how to generate descriptions of images or videos received major...

Please sign up or login with your details

Forgot password? Click here to reset