Learning to reason over visual objects

03/03/2023
by   Shanka Subhra Mondal, et al.
0

A core component of human intelligence is the ability to identify abstract patterns inherent in complex, high-dimensional perceptual data, as exemplified by visual reasoning tasks such as Raven's Progressive Matrices (RPM). Motivated by the goal of designing AI systems with this capacity, recent work has focused on evaluating whether neural networks can learn to solve RPM-like problems. Previous work has generally found that strong performance on these problems requires the incorporation of inductive biases that are specific to the RPM problem format, raising the question of whether such models might be more broadly useful. Here, we investigated the extent to which a general-purpose mechanism for processing visual scenes in terms of objects might help promote abstract visual reasoning. We found that a simple model, consisting only of an object-centric encoder and a transformer reasoning module, achieved state-of-the-art results on both of two challenging RPM-like benchmarks (PGM and I-RAVEN), as well as a novel benchmark with greater visual complexity (CLEVR-Matrices). These results suggest that an inductive bias for object-centric processing may be a key component of abstract visual reasoning, obviating the need for problem-specific inductive biases.

READ FULL TEXT

page 5

page 17

page 18

page 19

page 20

page 21

page 22

research
06/04/2023

Systematic Visual Reasoning through Object-Centric Relational Abstraction

Human visual reasoning is characterized by an ability to identify abstra...
research
09/19/2023

A Cognitively-Inspired Neural Architecture for Visual Abstract Reasoning Using Contrastive Perceptual and Conceptual Processing

We introduce a new neural architecture for solving visual abstract reaso...
research
10/26/2022

Multi-Viewpoint and Multi-Evaluation with Felicitous Inductive Bias Boost Machine Abstract Reasoning Ability

Great endeavors have been made to study AI's ability in abstract reasoni...
research
08/12/2023

Learning Abstract Visual Reasoning via Task Decomposition: A Case Study in Raven Progressive Matrices

One of the challenges in learning to perform abstract reasoning is that ...
research
03/13/2023

Evaluating Visual Number Discrimination in Deep Neural Networks

The ability to discriminate between large and small quantities is a core...
research
08/14/2023

The minimal computational substrate of fluid intelligence

The quantification of cognitive powers rests on identifying a behavioura...
research
04/19/2023

Beyond Transformers for Function Learning

The ability to learn and predict simple functions is a key aspect of hum...

Please sign up or login with your details

Forgot password? Click here to reset