Deep Convolutional Inverse Graphics Network

03/11/2015
by   Tejas D. Kulkarni, et al.
0

This paper presents the Deep Convolution Inverse Graphics Network (DC-IGN), a model that learns an interpretable representation of images. This representation is disentangled with respect to transformations such as out-of-plane rotations and lighting variations. The DC-IGN model is composed of multiple layers of convolution and de-convolution operators and is trained using the Stochastic Gradient Variational Bayes (SGVB) algorithm. We propose a training procedure to encourage neurons in the graphics code layer to represent a specific transformation (e.g. pose or light). Given a single input image, our model can generate new images of the same object with variations in pose and lighting. We present qualitative and quantitative results of the model's efficacy at learning a 3D rendering engine.

READ FULL TEXT

page 5

page 6

research
10/12/2016

Deep disentangled representations for volumetric reconstruction

We introduce a convolutional neural network for inferring a compact dise...
research
02/20/2020

BlockGAN: Learning 3D Object-aware Scene Representations from Unlabelled Images

We present BlockGAN, an image generative model that learns object-aware ...
research
07/18/2022

I2I: Image to Icosahedral Projection for SO(3) Object Reasoning from Single-View Images

Reasoning about 3D objects based on 2D images is challenging due to larg...
research
10/18/2020

Image GANs meet Differentiable Rendering for Inverse Graphics and Interpretable 3D Neural Rendering

Differentiable rendering has paved the way to training neural networks t...
research
05/10/2021

Robust Training Using Natural Transformation

Previous robustness approaches for deep learning models such as data aug...
research
05/13/2021

SyntheticFur dataset for neural rendering

We introduce a new dataset called SyntheticFur built specifically for ma...
research
06/29/2020

Explainable 3D Convolutional Neural Networks by Learning Temporal Transformations

In this paper we introduce the temporally factorized 3D convolution (3TC...

Please sign up or login with your details

Forgot password? Click here to reset