HoloGAN: Unsupervised learning of 3D representations from natural images

04/02/2019
by   Thu Nguyen-Phuoc, et al.
68

We propose a novel generative adversarial network (GAN) for the task of unsupervised learning of 3D representations from natural images. Most generative models rely on 2D kernels to generate images and make few assumptions about the 3D world. These models therefore tend to create blurry images or artefacts in tasks that require a strong 3D understanding, such as novel-view synthesis. HoloGAN instead learns a 3D representation of the world, and to render this representation in a realistic manner. Unlike other GANs, HoloGAN provides explicit control over the pose of generated objects through rigid-body transformations of the learnt 3D features. Our experiments show that using explicit 3D features enables HoloGAN to disentangle 3D pose and identity, which is further decomposed into shape and appearance, while still being able to generate images with similar or higher visual quality than other generative models. HoloGAN can be trained end-to-end from unlabelled 2D images only. Particularly, we do not require pose labels, 3D shapes, or multiple views of the same objects. This shows that HoloGAN is the first generative model that learns 3D representations from natural images in an entirely unsupervised manner.

READ FULL TEXT

page 1

page 3

page 5

page 6

page 7

page 8

research
03/05/2017

LR-GAN: Layered Recursive Generative Adversarial Networks for Image Generation

We present LR-GAN: an adversarial image generation model which takes sce...
research
07/03/2016

Unsupervised Learning of 3D Structure from Images

A key goal of computer vision is to recover the underlying 3D structure ...
research
09/12/2014

Unsupervised learning of clutter-resistant visual representations from natural videos

Populations of neurons in inferotemporal cortex (IT) maintain an explici...
research
04/19/2022

Unsupervised Learning of Efficient Geometry-Aware Neural Articulated Representations

We propose an unsupervised method for 3D geometry-aware representation l...
research
06/13/2022

AR-NeRF: Unsupervised Learning of Depth and Defocus Effects from Natural Images with Aperture Rendering Neural Radiance Fields

Fully unsupervised 3D representation learning has gained attention owing...
research
10/11/2020

Resolution Dependant GAN Interpolation for Controllable Image Synthesis Between Domains

GANs can generate photo-realistic images from the domain of their traini...
research
06/24/2021

Unsupervised Learning of Depth and Depth-of-Field Effect from Natural Images with Aperture Rendering Generative Adversarial Networks

Understanding the 3D world from 2D projected natural images is a fundame...

Please sign up or login with your details

Forgot password? Click here to reset