RAVEN: A Dataset for Relational and Analogical Visual rEasoNing

03/07/2019
by   Chi Zhang, et al.
0

Dramatic progress has been witnessed in basic vision tasks involving low-level perception, such as object recognition, detection, and tracking. Unfortunately, there is still an enormous performance gap between artificial vision systems and human intelligence in terms of higher-level vision problems, especially ones involving reasoning. Earlier attempts in equipping machines with high-level reasoning have hovered around Visual Question Answering (VQA), one typical task associating vision and language understanding. In this work, we propose a new dataset, built in the context of Raven's Progressive Matrices (RPM) and aimed at lifting machine intelligence by associating vision with structural, relational, and analogical reasoning in a hierarchical representation. Unlike previous works in measuring abstract reasoning using RPM, we establish a semantic link between vision and reasoning by providing structure representation. This addition enables a new type of abstract reasoning by jointly operating on the structure representation. Machine reasoning ability using modern computer vision is evaluated in this newly proposed dataset. Additionally, we also provide human performance as a reference. Finally, we show consistent improvement across all models by incorporating a simple neural module that combines visual understanding and structure reasoning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/11/2015

Visual7W: Grounded Question Answering in Images

We have seen great progress in basic perceptual tasks such as object rec...
research
07/29/2019

V-PROM: A Benchmark for Visual Reasoning Using Visual Progressive Matrices

One of the primary challenges faced by deep learning is the degree to wh...
research
10/04/2019

Few-Shot Abstract Visual Reasoning With Spectral Features

We present an image preprocessing technique capable of improving the per...
research
04/25/2020

Machine Number Sense: A Dataset of Visual Arithmetic Problems for Abstract and Relational Reasoning

As a comprehensive indicator of mathematical thinking and intelligence, ...
research
06/11/2020

Visualizing and Understanding Vision System

How the human vision system addresses the object identity-preserving rec...
research
02/09/2018

Not-So-CLEVR: Visual Relations Strain Feedforward Neural Networks

The robust and efficient recognition of visual relations in images is a ...
research
09/24/2022

Deep Neural Networks for Visual Reasoning

Visual perception and language understanding are - fundamental component...

Please sign up or login with your details

Forgot password? Click here to reset