Learning to Infer 3D Object Models from Images

06/11/2020
by   Chang Chen, et al.
4

A crucial ability of human intelligence is to build up models of individual 3D objects from partial scene observations. Recent works have enabled unsupervised 3D representation learning at scene-level, yet learning to decompose the 3D scene into 3D objects and build their individual models from multi-object scene images remains elusive. In this paper, we propose a probabilistic generative model for learning to build modular and compositional 3D object models from observations of a multi-object scene. The proposed model can (i) infer the 3D object representations by learning to search and group object areas and also (ii) render from an arbitrary viewpoint not only individual objects but also the full scene by compositing the objects. The entire learning process is unsupervised and end-to-end. We also demonstrate that the learned representation permits object-wise manipulation and novel scene generation, and generalizes to various settings.

READ FULL TEXT

page 5

page 6

research
12/07/2021

Unsupervised Learning of Compositional Scene Representations from Multiple Unspecified Viewpoints

Visual scenes are extremely rich in diversity, not only because there ar...
research
03/08/2021

Unsupervised Object-Based Transition Models for 3D Partially Observable Environments

We present a slot-wise, object-based transition model that decomposes a ...
research
08/05/2002

Probabilistic Search for Object Segmentation and Recognition

The problem of searching for a model-based scene interpretation is analy...
research
11/21/2022

Compositional Scene Modeling with Global Object-Centric Representations

The appearance of the same object may vary in different scene images due...
research
11/03/2020

Learning 3D Dynamic Scene Representations for Robot Manipulation

3D scene representation for robot manipulation should capture three key ...
research
03/19/2021

Knowledge-Guided Object Discovery with Acquired Deep Impressions

We present a framework called Acquired Deep Impressions (ADI) which cont...
research
01/08/2020

SPACE: Unsupervised Object-Oriented Scene Representation via Spatial Attention and Decomposition

The ability to decompose complex multi-object scenes into meaningful abs...

Please sign up or login with your details

Forgot password? Click here to reset