Atlas: End-to-End 3D Scene Reconstruction from Posed Images

03/23/2020
by   Zak Murez, et al.
6

We present an end-to-end 3D reconstruction method for a scene by directly regressing a truncated signed distance function (TSDF) from a set of posed RGB images. Traditional approaches to 3D reconstruction rely on an intermediate representation of depth maps prior to estimating a full 3D model of a scene. We hypothesize that a direct regression to 3D is more effective. A 2D CNN extracts features from each image independently which are then back-projected and accumulated into a voxel volume using the camera intrinsics and extrinsics. After accumulation, a 3D CNN refines the accumulated features and predicts the TSDF values. Additionally, semantic segmentation of the 3D model is obtained without significant computation. This approach is evaluated on the Scannet dataset where we significantly outperform state-of-the-art baselines (deep multiview stereo followed by traditional TSDF fusion) both quantitatively and qualitatively. We compare our 3D semantic segmentation to prior methods that use a depth sensor since no previous work attempts the problem with only RGB input.

READ FULL TEXT

page 2

page 6

page 7

page 8

page 9

page 14

page 15

research
08/08/2019

EdgeNet: Semantic Scene Completion from RGB-D images

Semantic scene completion is the task of predicting a complete 3D repres...
research
03/31/2020

Attention-based Multi-modal Fusion Network for Semantic Scene Completion

This paper presents an end-to-end 3D convolutional network named attenti...
research
04/10/2022

Scale Invariant Semantic Segmentation with RGB-D Fusion

In this paper, we propose a neural network architecture for scale-invari...
research
08/19/2021

VolumeFusion: Deep Depth Fusion for 3D Scene Reconstruction

To reconstruct a 3D scene from a set of calibrated views, traditional mu...
research
06/17/2020

Evaluation of 3D CNN Semantic Mapping for Rover Navigation

Terrain assessment is a key aspect for autonomous exploration rovers, su...
research
04/04/2017

Deep Depth From Focus

Depth from Focus (DFF) is one of the classical ill-posed inverse problem...
research
09/30/2017

Dense RGB-D semantic mapping with Pixel-Voxel neural network

For intelligent robotics applications, extending 3D mapping to 3D semant...

Please sign up or login with your details

Forgot password? Click here to reset