Solving Reasoning Tasks with a Slot Transformer

10/20/2022
by   Ryan Faulkner, et al.
0

The ability to carve the world into useful abstractions in order to reason about time and space is a crucial component of intelligence. In order to successfully perceive and act effectively using senses we must parse and compress large amounts of information for further downstream reasoning to take place, allowing increasingly complex concepts to emerge. If there is any hope to scale representation learning methods to work with real world scenes and temporal dynamics then there must be a way to learn accurate, concise, and composable abstractions across time. We present the Slot Transformer, an architecture that leverages slot attention, transformers and iterative variational inference on video scene data to infer such representations. We evaluate the Slot Transformer on CLEVRER, Kinetics-600 and CATER datesets and demonstrate that the approach allows us to develop robust modeling and reasoning around complex behaviours as well as scores on these datasets that compare favourably to existing baselines. Finally we evaluate the effectiveness of key components of the architecture, the model's representational capacity and its ability to predict from incomplete input.

READ FULL TEXT

page 7

page 8

page 10

page 18

page 19

research
06/12/2023

Slot-VAE: Object-Centric Scene Generation with Slot Attention

Slot attention has shown remarkable object-centric representation learni...
research
03/25/2022

Unsupervised Learning of Temporal Abstractions with Slot-based Transformers

The discovery of reusable sub-routines simplifies decision-making and pl...
research
10/17/2022

Unsupervised Object-Centric Learning with Bi-Level Optimized Query Slot Attention

The ability to decompose complex natural scenes into meaningful object-c...
research
11/02/2022

Neural Block-Slot Representations

In this paper, we propose a novel object-centric representation, called ...
research
07/23/2021

Constellation: Learning relational abstractions over objects for compositional imagination

Learning structured representations of visual scenes is currently a majo...
research
02/21/2023

Reusable Slotwise Mechanisms

Agents that can understand and reason over the dynamics of objects can h...
research
02/14/2023

Graph schemas as abstractions for transfer learning, inference, and planning

We propose schemas as a model for abstractions that can be used for rapi...

Please sign up or login with your details

Forgot password? Click here to reset