Learning Direct Optimization for Scene Understanding

12/18/2018
by   Lukasz Romaszko, et al.
6

We introduce a Learning Direct Optimization method for the refinement of a latent variable model that describes input image x. Our goal is to explain a single image x with a 3D computer graphics model having scene graph latent variables z (such as object appearance, camera position). Given a current estimate of z we can render a prediction of the image g(z), which can be compared to the image x. The standard way to proceed is then to measure the error E(x, g(z)) between the two, and use an optimizer to minimize the error. Our novel Learning Direct Optimization (LiDO) approach trains a Prediction Network to predict an update directly to correct z, rather than minimizing the error with respect to z. Experiments show that our LiDO method converges rapidly as it does not need to perform a search on the error landscape, produces better solutions, and is able to handle the mismatch between the data and the fitted scene model. We apply the LiDO to a realistic synthetic dataset, and show that the method transfers to work well with real images.

READ FULL TEXT

page 1

page 3

page 7

page 8

page 11

page 12

research
12/18/2019

SynSin: End-to-end View Synthesis from a Single Image

Single image view synthesis allows for the generation of new views of a ...
research
11/21/2017

Aperture Supervision for Monocular Depth Estimation

We present a novel method to train machine learning algorithms to estima...
research
02/17/2021

ShaRF: Shape-conditioned Radiance Fields from a Single View

We present a method for estimating neural scenes representations of obje...
research
04/04/2018

Stochastic Adversarial Video Prediction

Being able to predict what may happen in the future requires an in-depth...
research
11/29/2018

Learning to Separate Multiple Illuminants in a Single Image

We present a method to separate a single image captured under two illumi...
research
10/08/2020

Single-Image Camera Response Function Using Prediction Consistency and Gradual Refinement

A few methods have been proposed to estimate the CRF from a single image...
research
06/25/2020

Perspective Plane Program Induction from a Single Image

We study the inverse graphics problem of inferring a holistic representa...

Please sign up or login with your details

Forgot password? Click here to reset