A Dataset and Baselines for Visual Question Answering on Art

08/28/2020
by   Noa Garcia, et al.
20

Answering questions related to art pieces (paintings) is a difficult task, as it implies the understanding of not only the visual information that is shown in the picture, but also the contextual knowledge that is acquired through the study of the history of art. In this work, we introduce our first attempt towards building a new dataset, coined AQUA (Art QUestion Answering). The question-answer (QA) pairs are automatically generated using state-of-the-art question generation methods based on paintings and comments provided in an existing art understanding dataset. The QA pairs are cleansed by crowdsourcing workers with respect to their grammatical correctness, answerability, and answers' correctness. Our dataset inherently consists of visual (painting-based) and knowledge (comment-based) questions. We also present a two-branch model as baseline, where the visual and knowledge questions are handled independently. We extensively compare our baseline model against the state-of-the-art models for question answering, and we provide a comprehensive study about the challenges and potential future directions for visual question answering on art.

READ FULL TEXT

page 2

page 12

page 13

research
05/03/2017

The Forgettable-Watcher Model for Video Question Answering

A number of visual question answering approaches have been proposed rece...
research
11/12/2018

Blindfold Baselines for Embodied QA

We explore blindfold (question-only) baselines for Embodied Question Ans...
research
09/25/2018

ComQA: A Community-sourced Dataset for Complex Factoid Question Answering with Paraphrase Clusters

To bridge the gap between the capabilities of the state-of-the-art in fa...
research
06/11/2021

TellMeWhy: A Dataset for Answering Why-Questions in Narratives

Answering questions about why characters perform certain actions is cent...
research
01/05/2018

Towards Understanding and Answering Multi-Sentence Recommendation Questions on Tourism

We introduce the first system towards the novel task of answering comple...
research
04/08/2021

PQA: Perceptual Question Answering

Perceptual organization remains one of the very few established theories...
research
05/08/2015

Exploring Models and Data for Image Question Answering

This work aims to address the problem of image-based question-answering ...

Please sign up or login with your details

Forgot password? Click here to reset