QLEVR: A Diagnostic Dataset for Quantificational Language and Elementary Visual Reasoning

05/06/2022
by   Zechen Li, et al.
12

Synthetic datasets have successfully been used to probe visual question-answering datasets for their reasoning abilities. CLEVR (johnson2017clevr), for example, tests a range of visual reasoning abilities. The questions in CLEVR focus on comparisons of shapes, colors, and sizes, numerical reasoning, and existence claims. This paper introduces a minimally biased, diagnostic visual question-answering dataset, QLEVR, that goes beyond existential and numerical quantification and focus on more complex quantifiers and their combinations, e.g., asking whether there are more than two red balls that are smaller than at least three blue balls in an image. We describe how the dataset was created and present a first evaluation of state-of-the-art visual question-answering models, showing that QLEVR presents a formidable challenge to our current models. Code and Dataset are available at https://github.com/zechenli03/QLEVR

READ FULL TEXT

page 1

page 3

page 12

page 13

page 14

page 15

page 16

page 17

research
12/20/2016

CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning

When building artificial intelligence systems that can reason and answer...
research
03/14/2023

Evaluation of ChatGPT as a Question Answering System for Answering Complex Questions

ChatGPT is a powerful large language model (LLM) that has made remarkabl...
research
11/21/2019

Temporal Reasoning via Audio Question Answering

Multimodal question answering tasks can be used as proxy tasks to study ...
research
02/24/2022

Measuring CLEVRness: Blackbox testing of Visual Reasoning Models

How can we measure the reasoning capabilities of intelligence systems? V...
research
09/06/2018

Cascaded Mutual Modulation for Visual Reasoning

Visual reasoning is a special visual question answering problem that is ...
research
05/01/2020

Diverse Visuo-Lingustic Question Answering (DVLQA) Challenge

Existing question answering datasets mostly contain homogeneous contexts...
research
03/27/2022

MedMCQA : A Large-scale Multi-Subject Multi-Choice Dataset for Medical domain Question Answering

This paper introduces MedMCQA, a new large-scale, Multiple-Choice Questi...

Please sign up or login with your details

Forgot password? Click here to reset