BlockGAN: Learning 3D Object-aware Scene Representations from Unlabelled Images

02/20/2020
by   Thu Nguyen-Phuoc, et al.
44

We present BlockGAN, an image generative model that learns object-aware 3D scene representations directly from unlabelled 2D images. Current work on scene representation learning either ignores scene background or treats the whole scene as one object. Meanwhile, work that considers scene compositionality treats scene objects only as image patches or 2D layers with alpha maps. Inspired by the computer graphics pipeline, we design BlockGAN to learn to first generate 3D features of background and foreground objects, then combine them into 3D features for the wholes cene, and finally render them into realistic images. This allows BlockGAN to reason over occlusion and interaction between objects' appearance, such as shadow and lighting, and provides control over each object's 3D pose and identity, while maintaining image realism. BlockGAN is trained end-to-end, using only unlabelled single images, without the need for 3D geometry, pose labels, object masks, or multiple views of the same scene. Our experiments show that using explicit 3D features to represent objects allows BlockGAN to learn disentangled representations both in terms of objects (foreground and background) and their properties (pose and identity).

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 9

page 10

page 13

research
11/10/2022

DisPositioNet: Disentangled Pose and Identity in Semantic Image Manipulation

Graph representation of objects and their relations in a scene, known as...
research
04/19/2022

Unsupervised Learning of Efficient Geometry-Aware Neural Articulated Representations

We propose an unsupervised method for 3D geometry-aware representation l...
research
11/06/2020

Disentangling 3D Prototypical Networks For Few-Shot Concept Learning

We present neural architectures that disentangle RGB-D images into objec...
research
05/29/2019

Emergence of Object Segmentation in Perturbed Generative Models

We introduce a novel framework to build a model that can learn how to se...
research
03/11/2015

Deep Convolutional Inverse Graphics Network

This paper presents the Deep Convolution Inverse Graphics Network (DC-IG...
research
11/03/2018

Geometry-Aware Recurrent Neural Networks for Active Visual Recognition

We present recurrent geometry-aware neural networks that integrate visua...
research
09/17/2019

re-OBJ: Jointly Learning the Foreground and Background for Object Instance Re-identification

Conventional approaches to object instance re-identification rely on mat...

Please sign up or login with your details

Forgot password? Click here to reset