Cross-Dimensional Refined Learning for Real-Time 3D Visual Perception from Monocular Video

03/16/2023
by   Ziyang Hong, et al.
0

We present a novel real-time capable learning method that jointly perceives a 3D scene's geometry structure and semantic labels. Recent approaches to real-time 3D scene reconstruction mostly adopt a volumetric scheme, where a truncated signed distance function (TSDF) is directly regressed. However, these volumetric approaches tend to focus on the global coherence of their reconstructions, which leads to a lack of local geometrical detail. To overcome this issue, we propose to leverage the latent geometrical prior knowledge in 2D image features by explicit depth prediction and anchored feature generation, to refine the occupancy learning in TSDF volume. Besides, we find that this cross-dimensional feature refinement methodology can also be adopted for the semantic segmentation task. Hence, we proposed an end-to-end cross-dimensional refinement neural network (CDRNet) to extract both 3D mesh and 3D semantic labeling in real time. The experiment results show that the proposed method achieves state-of-the-art 3D perception efficiency on multiple datasets, which indicates the great potential of our method for industrial applications.

READ FULL TEXT

page 1

page 3

page 5

page 6

page 7

page 8

research
08/08/2019

EdgeNet: Semantic Scene Completion from RGB-D images

Semantic scene completion is the task of predicting a complete 3D repres...
research
08/11/2021

A Real-Time Online Learning Framework for Joint 3D Reconstruction and Semantic Segmentation of Indoor Scenes

This paper presents a real-time online vision framework to jointly recov...
research
03/20/2023

Real-time Semantic Scene Completion Via Feature Aggregation and Conditioned Prediction

Semantic Scene Completion (SSC) aims to simultaneously predict the volum...
research
10/26/2020

SCFusion: Real-time Incremental Scene Reconstruction with Semantic Completion

Real-time scene reconstruction from depth data inevitably suffers from o...
research
07/24/2019

SDNet: Semantically Guided Depth Estimation Network

Autonomous vehicles and robots require a full scene understanding of the...
research
03/08/2023

FastSurf: Fast Neural RGB-D Surface Reconstruction using Per-Frame Intrinsic Refinement and TSDF Fusion Prior Learning

We introduce FastSurf, an accelerated neural radiance field (NeRF) frame...
research
06/15/2022

PlanarRecon: Real-time 3D Plane Detection and Reconstruction from Posed Monocular Videos

We present PlanarRecon – a novel framework for globally coherent detecti...

Please sign up or login with your details

Forgot password? Click here to reset