Spotlight Attention: Robust Object-Centric Learning With a Spatial Locality Prior

05/31/2023
by   Ayush Chakravarthy, et al.
0

The aim of object-centric vision is to construct an explicit representation of the objects in a scene. This representation is obtained via a set of interchangeable modules called slots or object files that compete for local patches of an image. The competition has a weak inductive bias to preserve spatial continuity; consequently, one slot may claim patches scattered diffusely throughout the image. In contrast, the inductive bias of human vision is strong, to the degree that attention has classically been described with a spotlight metaphor. We incorporate a spatial-locality prior into state-of-the-art object-centric vision models and obtain significant improvements in segmenting objects in both synthetic and real-world datasets. Similar to human visual attention, the combination of image content and spatial constraints yield robust unsupervised object-centric learning, including less sensitivity to model hyperparameters.

READ FULL TEXT

page 6

page 7

page 8

research
03/31/2023

Shepherding Slots to Objects: Towards Stable and Robust Object-Centric Learning

Object-centric learning (OCL) aspires general and compositional understa...
research
12/31/2020

Language-Mediated, Object-Centric Representation Learning

We present Language-mediated, Object-centric Representation Learning (LO...
research
06/07/2023

Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities

Unsupervised video-based object-centric learning is a promising avenue t...
research
07/19/2021

Structured World Belief for Reinforcement Learning in POMDP

Object-centric world models provide structured representation of the sce...
research
10/17/2022

Unsupervised Object-Centric Learning with Bi-Level Optimized Query Slot Attention

The ability to decompose complex natural scenes into meaningful object-c...
research
01/12/2023

Scene-centric vs. Object-centric Image-Text Cross-modal Retrieval: A Reproducibility Study

Most approaches to cross-modal retrieval (CMR) focus either on object-ce...
research
04/20/2021

GENESIS-V2: Inferring Unordered Object Representations without Iterative Refinement

Advances in object-centric generative models (OCGMs) have culminated in ...

Please sign up or login with your details

Forgot password? Click here to reset