Bridging the Gap to Real-World Object-Centric Learning

09/29/2022
by   Maximilian Seitzer, et al.
38

Humans naturally decompose their environment into entities at the appropriate level of abstraction to act in the world. Allowing machine learning algorithms to derive this decomposition in an unsupervised way has become an important line of research. However, current methods are restricted to simulated data or require additional information in the form of motion or depth in order to successfully discover objects. In this work, we overcome this limitation by showing that reconstructing features from models trained in a self-supervised manner is a sufficient training signal for object-centric representations to arise in a fully unsupervised way. Our approach, DINOSAUR, significantly out-performs existing object-centric learning models on simulated data and is the first unsupervised object-centric model that scales to real world-datasets such as COCO and PASCAL VOC. DINOSAUR is conceptually simple and shows competitive performance compared to more involved pipelines from the computer vision literature.

READ FULL TEXT

page 6

page 7

page 9

page 16

page 23

page 24

page 25

page 26

research
06/07/2023

Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities

Unsupervised video-based object-centric learning is a promising avenue t...
research
05/18/2023

SlotDiffusion: Object-Centric Generative Modeling with Diffusion Models

Object-centric learning aims to represent visual data with a set of obje...
research
10/24/2021

DAG Card is the new Model Card

With the progressive commoditization of modeling capabilities, data-cent...
research
10/07/2021

Unsupervised Image Decomposition with Phase-Correlation Networks

The ability to decompose scenes into their object components is a desire...
research
04/20/2021

GENESIS-V2: Inferring Unordered Object Representations without Iterative Refinement

Advances in object-centric generative models (OCGMs) have culminated in ...
research
04/05/2022

Complex-Valued Autoencoders for Object Discovery

Object-centric representations form the basis of human perception and en...
research
06/01/2023

Rotating Features for Object Discovery

The binding problem in human cognition, concerning how the brain represe...

Please sign up or login with your details

Forgot password? Click here to reset