Curiosity-driven 3D Scene Structure from Single-image Self-supervision

12/02/2020
by   David Griffiths, et al.
2

Previous work has demonstrated learning isolated 3D objects (voxel grids, point clouds, meshes, etc.) from 2D-only self-supervision. We here set out to extend this to entire 3D scenes made out of multiple objects, including their location, orientation and type, and the scenes illumination. Once learned, we can map arbitrary 2D images to 3D scene structure. We analyze why analysis-by-synthesis-like losses for supervision of 3D scene structure using differentiable rendering is not practical, as it almost always gets stuck in local minima of visual ambiguities. This can be overcome by a novel form of training: we use an additional network to steer the optimization itself to explore the full gamut of possible solutions i.e. to be curious, and hence, to resolve those ambiguities and find workable minima. The resulting system converts 2D images of different virtual or real images into complete 3D scenes, learned only from 2D images of those scenes.

READ FULL TEXT

page 1

page 3

page 5

page 7

page 8

page 11

page 12

page 13

research
06/13/2020

Equivariant Neural Rendering

We propose a framework for learning neural scene representations directl...
research
03/30/2021

Repopulating Street Scenes

We present a framework for automatically reconfiguring images of street ...
research
12/08/2020

A Number Sense as an Emergent Property of the Manipulating Brain

The ability to understand and manipulate numbers and quantities emerges ...
research
11/27/2022

3inGAN: Learning a 3D Generative Model from Images of a Self-similar Scene

We introduce 3inGAN, an unconditional 3D generative model trained from 2...
research
02/28/2017

SceneSuggest: Context-driven 3D Scene Design

We present SceneSuggest: an interactive 3D scene design system providing...
research
04/18/2020

Example-Guided Image Synthesis across Arbitrary Scenes using Masked Spatial-Channel Attention and Self-Supervision

Example-guided image synthesis has recently been attempted to synthesize...
research
04/06/2020

Finding Your (3D) Center: 3D Object Detection Using a Learned Loss

Massive semantic labeling is readily available for 2D images, but much h...

Please sign up or login with your details

Forgot password? Click here to reset