Towards causal generative scene models via competition of experts

04/27/2020
by   Julius von Kügelgen, et al.
4

Learning how to model complex scenes in a modular way with recombinable components is a pre-requisite for higher-order reasoning and acting in the physical world. However, current generative models lack the ability to capture the inherently compositional and layered nature of visual scenes. While recent work has made progress towards unsupervised learning of object-based scene representations, most models still maintain a global representation space (i.e., objects are not explicitly separated), and cannot generate scenes with novel object arrangement and depth ordering. Here, we present an alternative approach which uses an inductive bias encouraging modularity by training an ensemble of generative models (experts). During training, experts compete for explaining parts of a scene, and thus specialise on different object classes, with objects being identified as parts that re-occur across multiple scenes. Our model allows for controllable sampling of individual objects and recombination of experts in physically plausible ways. In contrast to other methods, depth layering and occlusion are handled correctly, moving this approach closer to a causal generative scene model. Experiments on simple toy data qualitatively demonstrate the conceptual advantages of the proposed approach.

READ FULL TEXT

page 2

page 10

page 11

page 18

research
10/13/2021

Unsupervised Object Learning via Common Fate

Learning generative object models from unlabelled videos is a long stand...
research
11/24/2020

GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields

Deep generative models allow for photorealistic image synthesis at high ...
research
07/30/2019

GENESIS: Generative Scene Inference and Sampling with Object-Centric Latent Representations

Generative models are emerging as promising tools in robotics and reinfo...
research
10/31/2022

gCoRF: Generative Compositional Radiance Fields

3D generative models of objects enable photorealistic image synthesis wi...
research
07/02/2020

RELATE: Physically Plausible Multi-Object Scene Synthesis Using Structured Latent Spaces

We present RELATE, a model that learns to generate physically plausible ...
research
12/07/2020

Generating unseen complex scenes: are we there yet?

Although recent complex scene conditional generation models generate inc...
research
03/21/2022

Generating Fast and Slow: Scene Decomposition via Reconstruction

We consider the problem of segmenting scenes into constituent entities, ...

Please sign up or login with your details

Forgot password? Click here to reset