A Survey of Current Datasets for Vision and Language Research

06/23/2015
by   Francis Ferraro, et al.
0

Integrating vision and language has long been a dream in work on artificial intelligence (AI). In the past two years, we have witnessed an explosion of work that brings together vision and language from images to videos and beyond. The available corpora have played a crucial role in advancing this area of research. In this paper, we propose a set of quality metrics for evaluating and analyzing the vision language datasets and categorize them accordingly. Our analyses show that the most recent datasets have been using more complex language and more abstract concepts, however, there are different strengths and weaknesses in each.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/03/2023

Witgenstein's influence on artificial intelligence

We examine how much of the contemporary progress in artificial intellige...
research
05/08/2023

ChatGPT: Vision and Challenges

Artificial intelligence (AI) and machine learning have changed the natur...
research
06/26/2021

Core Challenges in Embodied Vision-Language Planning

Recent advances in the areas of multimodal machine learning and artifici...
research
12/26/2022

VQA and Visual Reasoning: An Overview of Recent Datasets, Methods and Challenges

Artificial Intelligence (AI) and its applications have sparked extraordi...
research
01/13/2022

Fantastic Data and How to Query Them

It is commonly acknowledged that the availability of the huge amount of ...
research
11/28/2021

Explore the Potential Performance of Vision-and-Language Navigation Model: a Snapshot Ensemble Method

Vision-and-Language Navigation (VLN) is a challenging task in the field ...
research
05/29/2020

Beyond Leaderboards: A survey of methods for revealing weaknesses in Natural Language Inference data and models

Recent years have seen a growing number of publications that analyse Nat...

Please sign up or login with your details

Forgot password? Click here to reset