Relational Neural Expectation Maximization: Unsupervised Discovery of Objects and their Interactions

02/28/2018
by   Sjoerd van Steenkiste, et al.
0

Common-sense physical reasoning is an essential ingredient for any intelligent agent operating in the real-world. For example, it can be used to simulate the environment, or to infer the state of parts of the world that are currently unobserved. In order to match real-world conditions this causal knowledge must be learned without access to supervised data. To address this problem we present a novel method that learns to discover objects and model their physical interactions from raw visual images in a purely unsupervised fashion. It incorporates prior knowledge about the compositional nature of human perception to factor interactions between object-pairs and learn efficiently. On videos of bouncing balls we show the superior modelling capabilities of our method compared to other unsupervised neural approaches that do not incorporate such prior knowledge. We demonstrate its ability to handle occlusion and show that it can extrapolate learned knowledge to scenes with different numbers of objects.

READ FULL TEXT

page 3

page 6

page 7

page 9

research
10/07/2020

Hierarchical Relational Inference

Common-sense physical reasoning in the real world requires learning abou...
research
06/03/2019

A Perspective on Objects and Systematic Generalization in Model-Based RL

In order to meet the diverse challenges in solving many real-world probl...
research
07/24/2020

Unsupervised Discovery of 3D Physical Objects from Video

We study the problem of unsupervised physical object discovery. Unlike e...
research
05/02/2022

ComPhy: Compositional Physical Reasoning of Objects and Events from Videos

Objects' motions in nature are governed by complex interactions and thei...
research
08/11/2017

Neural Expectation Maximization

Many real world tasks such as reasoning and physical interaction require...
research
05/14/2018

Unsupervised Intuitive Physics from Visual Observations

While learning models of intuitive physics is an increasingly active are...
research
06/13/2017

The "something something" video database for learning and evaluating visual common sense

Neural networks trained on datasets such as ImageNet have led to major a...

Please sign up or login with your details

Forgot password? Click here to reset