Multi-layer Depth and Epipolar Feature Transformers for 3D Scene Reconstruction

02/18/2019
by   Daeyun Shin, et al.
0

We tackle the problem of automatically reconstructing a complete 3D model of a scene from a single RGB image. This challenging task requires inferring the shape of both visible and occluded surfaces. Our approach utilizes viewer-centered, multi-layer representation of scene geometry adapted from recent methods for single object shape completion. To improve the accuracy of view-centered representations for complex scenes, we introduce a novel "Epipolar Feature Transformer" that transfers convolutional network features from an input view to other virtual camera viewpoints, and thus better covers the 3D scene geometry. Unlike existing approaches that first detect and localize objects in 3D, and then infer object shape using category-specific models, our approach is fully convolutional, end-to-end differentiable, and avoids the resolution and memory limitations of voxel representations. We demonstrate the advantages of multi-layer depth representations and epipolar feature transformers on the reconstruction of a large database of indoor scenes.

READ FULL TEXT

page 1

page 2

page 4

page 5

page 7

research
04/17/2018

Pixels, voxels, and views: A study of shape representations for single view 3D object shape prediction

The goal of this paper is to compare surface-based and volumetric 3D obj...
research
08/26/2019

Object-Driven Multi-Layer Scene Decomposition From a Single Image

We present a method that tackles the challenge of predicting color and d...
research
05/19/2019

Geometric Pose Affordance: 3D Human Pose with Scene Constraints

Full 3D estimation of human pose from a single image remains a challengi...
research
07/05/2021

TransformerFusion: Monocular RGB Scene Reconstruction using Transformers

We introduce TransformerFusion, a transformer-based 3D scene reconstruct...
research
09/13/2022

Multiple View Performers for Shape Completion

We propose the Multiple View Performer (MVP) - a new architecture for 3D...
research
01/13/2022

Stereo Magnification with Multi-Layer Images

Representing scenes with multiple semi-transparent colored layers has be...
research
03/24/2022

RayTran: 3D pose estimation and shape reconstruction of multiple objects from videos with ray-traced transformers

We propose a transformer-based neural network architecture for multi-obj...

Please sign up or login with your details

Forgot password? Click here to reset