Attention on Abstract Visual Reasoning

11/14/2019
by   Lukas Hahne, et al.
13

Attention mechanisms have been boosting the performance of deep learning models on a wide range of applications, ranging from speech understanding to program induction. However, despite experiments from psychology which suggest that attention plays an essential role in visual reasoning, the full potential of attention mechanisms has so far not been explored to solve abstract cognitive tasks on image data. In this work, we propose a hybrid network architecture, grounded on self-attention and relational reasoning. We call this new model Attention Relation Network (ARNe). ARNe combines features from the recently introduced Transformer and the Wild Relation Network (WReN). We test ARNe on the Procedurally Generated Matrices (PGMs) datasets for abstract visual reasoning. ARNe excels the WReN model on this task by 11.28 ppt. Relational concepts between objects are efficiently learned demanding only 35 training samples to surpass reported accuracy of the base line model. Our proposed hybrid model, represents an alternative on learning abstract relations using self-attention and demonstrates that the Transformer network is also well suited for abstract visual reasoning.

READ FULL TEXT

page 2

page 3

page 8

page 15

research
06/26/2023

PhD Thesis: Exploring the role of (self-)attention in cognitive and computer vision architecture

We investigate the role of attention and memory in complex reasoning tas...
research
11/29/2021

Recurrent Vision Transformer for Solving Visual Reasoning Problems

Although convolutional neural networks (CNNs) showed remarkable results ...
research
08/08/2021

Understanding the computational demands underlying visual reasoning

Visual understanding requires comprehending complex visual relations bet...
research
04/25/2020

Machine Number Sense: A Dataset of Visual Arithmetic Problems for Abstract and Relational Reasoning

As a comprehensive indicator of mathematical thinking and intelligence, ...
research
04/14/2023

The role of object-centric representations, guided attention, and external memory on generalizing visual relations

Visual reasoning is a long-term goal of vision research. In the last dec...
research
10/12/2021

Dynamic Inference with Neural Interpreters

Modern neural network architectures can leverage large amounts of data t...
research
06/05/2023

Infusing Lattice Symmetry Priors in Attention Mechanisms for Sample-Efficient Abstract Geometric Reasoning

The Abstraction and Reasoning Corpus (ARC) (Chollet, 2019) and its most ...

Please sign up or login with your details

Forgot password? Click here to reset