Learning to generate new indoor scenes

12/10/2019
by   Pulak Purkait, et al.
16

Deep generative models have been used in recent years to learn coherent latent representations in order to synthesize high quality images. In this work we propose a neural network to learn a generative model for sampling consistent indoor scene layouts. Our method learns the co-occurrences, and appearance parameters such as shape and pose, for different objects categories through a grammar-based auto-encoder, resulting in a compact and accurate representation for scene layouts. In contrast to existing grammar-based methods with a user-specified grammar, we construct the grammar automatically by extracting a set of production rules on reasoning about object co-occurrences in training data. The extracted grammar is able to represent a scene by an augmented parse tree. The proposed auto-encoder encodes these parse trees to a latent code, and decodes the latent code to a parse-tree, thereby ensuring the generated scene is always valid. We experimentally demonstrate that the proposed auto-encoder learns not only to generate valid scenes (i.e. the arrangements and appearances of objects), but it also learns coherent latent representations where nearby latent samples decode to similar scene outputs. The obtained generative model is applicable to several computer vision tasks such as 3D pose and layout estimation from RGB-D data.

READ FULL TEXT

page 8

page 17

page 18

page 20

page 21

research
03/06/2017

Grammar Variational Autoencoder

Deep generative models have been wildly successful at learning coherent ...
research
03/08/2019

Auto-Encoding Progressive Generative Adversarial Networks For 3D Multi Object Scenes

3D multi object generative models allow us to synthesize a large range o...
research
04/19/2023

NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models

Automatically generating high-quality real world 3D scenes is of enormou...
research
11/08/2012

3D Scene Grammar for Parsing RGB-D Pointclouds

We pose 3D scene-understanding as a problem of parsing in a grammar. A g...
research
04/25/2019

Meta-Sim: Learning to Generate Synthetic Datasets

Training models to high-end performance requires availability of large l...
research
08/11/2020

Adversarial Generative Grammars for Human Activity Prediction

In this paper we propose an adversarial generative grammar model for fut...
research
01/22/2020

From abstract items to latent spaces to observed data and back: Compositional Variational Auto-Encoder

Conditional Generative Models are now acknowledged an essential tool in ...

Please sign up or login with your details

Forgot password? Click here to reset