View-volume Network for Semantic Scene Completion from a Single Depth Image

06/14/2018
by   Yu-Xiao Guo, et al.
0

We introduce a View-Volume convolutional neural network (VVNet) for inferring the occupancy and semantic labels of a volumetric 3D scene from a single depth image. The VVNet concatenates a 2D view CNN and a 3D volume CNN with a differentiable projection layer. Given a single RGBD image, our method extracts the detailed geometric features from the input depth image with a 2D view CNN and then projects the features into a 3D volume according to the input depth map via a projection layer. After that, we learn the 3D context information of the scene with a 3D volume CNN for computing the result volumetric occupancy and semantic labels. With combined 2D and 3D representations, the VVNet efficiently reduces the computational cost, enables feature extraction from multi-channel high resolution inputs, and thus significantly improves the result accuracy. We validate our method and demonstrate its efficiency and effectiveness on both synthetic SUNCG and real NYU dataset.

READ FULL TEXT

page 3

page 7

research
11/28/2016

Semantic Scene Completion from a Single Depth Image

This paper focuses on semantic scene completion, a task for producing a ...
research
07/03/2020

ODE-CNN: Omnidirectional Depth Extension Networks

Omnidirectional 360 camera proliferates rapidly for autonomous robots si...
research
03/10/2019

Deep Reinforcement Learning of Volume-guided Progressive View Inpainting for 3D Point Scene Completion from a Single Depth Image

We present a deep reinforcement learning method of progressive view inpa...
research
04/10/2017

CanvoX: High-resolution VR Painting in Large Volumetric Canvas

With virtual reality, digital painting on 2D canvases is now being exten...
research
09/03/2019

ForkNet: Multi-branch Volumetric Semantic Completion from a Single Depth Image

We propose a novel model for 3D semantic completion from a single depth ...
research
03/17/2023

Semantic Scene Completion with Cleaner Self

Semantic Scene Completion (SSC) transforms an image of single-view depth...
research
05/05/2021

Cuboids Revisited: Learning Robust 3D Shape Fitting to Single RGB Images

Humans perceive and construct the surrounding world as an arrangement of...

Please sign up or login with your details

Forgot password? Click here to reset