The Color of the Cat is Gray: 1 Million Full-Sentences Visual Question Answering (FSVQA)

09/21/2016
by   Andrew Shin, et al.
0

Visual Question Answering (VQA) task has showcased a new stage of interaction between language and vision, two of the most pivotal components of artificial intelligence. However, it has mostly focused on generating short and repetitive answers, mostly single words, which fall short of rich linguistic capabilities of humans. We introduce Full-Sentence Visual Question Answering (FSVQA) dataset, consisting of nearly 1 million pairs of questions and full-sentence answers for images, built by applying a number of rule-based natural language processing techniques to original VQA dataset and captions in the MS COCO dataset. This poses many additional complexities to conventional VQA task, and we provide a baseline for approaching and evaluating the task, on top of which we invite the research community to build further improvements.

READ FULL TEXT

page 1

page 6

research
05/03/2015

VQA: Visual Question Answering

We propose the task of free-form and open-ended Visual Question Answerin...
research
11/23/2019

Unsupervised Keyword Extraction for Full-sentence VQA

In existing studies on Visual Question Answering (VQA), which aims to tr...
research
04/06/2016

A Focused Dynamic Attention Model for Visual Question Answering

Visual Question and Answering (VQA) problems are attracting increasing i...
research
07/18/2023

Generative Visual Question Answering

Multi-modal tasks involving vision and language in deep learning continu...
research
08/15/2017

VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation

Rich and dense human labeled datasets are among the main enabling factor...
research
08/31/2016

Measuring Machine Intelligence Through Visual Question Answering

As machines have become more intelligent, there has been a renewed inter...
research
07/03/2020

Visual Question Answering as a Multi-Task Problem

Visual Question Answering(VQA) is a highly complex problem set, relying ...

Please sign up or login with your details

Forgot password? Click here to reset