Multi-Object Representation Learning with Iterative Variational Inference

03/01/2019
by   Klaus Greff, et al.
0

Human perception is structured around objects which form the basis for our higher-level cognition and impressive systematic generalization abilities. Yet most work on representation learning focuses on feature learning without even considering multiple objects, or treats segmentation as an (often supervised) preprocessing step. Instead, we argue for the importance of learning to segment and represent objects jointly. We demonstrate that, starting from the simple assumption that a scene is composed of multiple entities, it is possible to learn to segment images into interpretable objects with disentangled representations. Our method learns -- without supervision -- to inpaint occluded parts, and extrapolates to scenes with more objects and to unseen objects with novel feature combinations. We also show that, due to the use of iterative variational inference, our system is able to learn multi-modal posteriors for ambiguous inputs and extends naturally to sequences.

READ FULL TEXT

page 5

page 7

page 11

page 14

page 15

page 16

page 17

page 18

research
09/25/2019

LAVAE: Disentangling Location and Appearance

We propose a probabilistic generative model for unsupervised learning of...
research
05/08/2023

The Treachery of Images: Bayesian Scene Keypoints for Deep Policy Learning in Robotic Manipulation

In policy learning for robotic manipulation, sample efficiency is of par...
research
11/21/2022

Compositional Scene Modeling with Global Object-Centric Representations

The appearance of the same object may vary in different scene images due...
research
04/25/2023

On the Generalization of Learned Structured Representations

Despite tremendous progress over the past decade, deep learning methods ...
research
09/02/2022

Object-based active inference

The world consists of objects: distinct entities possessing independent ...
research
11/19/2015

Binding via Reconstruction Clustering

Disentangled distributed representations of data are desirable for machi...
research
07/17/2020

AlignNet: Unsupervised Entity Alignment

Recently developed deep learning models are able to learn to segment sce...

Please sign up or login with your details

Forgot password? Click here to reset