StereoScene: BEV-Assisted Stereo Matching Empowers 3D Semantic Scene Completion

03/24/2023
by   Bohan Li, et al.
0

3D semantic scene completion (SSC) is an ill-posed task that requires inferring a dense 3D scene from incomplete observations. Previous methods either explicitly incorporate 3D geometric input or rely on learnt 3D prior behind monocular RGB images. However, 3D sensors such as LiDAR are expensive and intrusive while monocular cameras face challenges in modeling precise geometry due to the inherent ambiguity. In this work, we propose StereoScene for 3D Semantic Scene Completion (SSC), which explores taking full advantage of light-weight camera inputs without resorting to any external 3D sensors. Our key insight is to leverage stereo matching to resolve geometric ambiguity. To improve its robustness in unmatched areas, we introduce bird's-eye-view (BEV) representation to inspire hallucination ability with rich context information. On top of the stereo and BEV representations, a mutual interactive aggregation (MIA) module is carefully devised to fully unleash their power. Specifically, a Bi-directional Interaction Transformer (BIT) augmented with confidence re-weighting is used to encourage reliable prediction through mutual guidance while a Dual Volume Aggregation (DVA) module is designed to facilitate complementary aggregation. Experimental results on SemanticKITTI demonstrate that the proposed StereoScene outperforms the state-of-the-art camera-based methods by a large margin with a relative improvement of 26.9 38.6

READ FULL TEXT

page 1

page 4

page 8

research
08/18/2020

DeepLiDARFlow: A Deep Learning Architecture For Scene Flow Estimation Using Monocular Camera and Sparse LiDAR

Scene flow is the dense 3D reconstruction of motion and geometry of a sc...
research
02/27/2023

OccDepth: A Depth-Aware Method for 3D Semantic Scene Completion

3D Semantic Scene Completion (SSC) can provide dense geometric and seman...
research
12/01/2021

MonoScene: Monocular 3D Semantic Scene Completion

MonoScene proposes a 3D Semantic Scene Completion (SSC) framework, where...
research
01/31/2023

Monocular Scene Reconstruction with 3D SDF Transformers

Monocular scene reconstruction from posed images is challenging due to t...
research
03/20/2023

Real-time Semantic Scene Completion Via Feature Aggregation and Conditioned Prediction

Semantic Scene Completion (SSC) aims to simultaneously predict the volum...
research
04/10/2019

DAVANet: Stereo Deblurring with View Aggregation

Nowadays stereo cameras are more commonly adopted in emerging devices su...

Please sign up or login with your details

Forgot password? Click here to reset