Learning Depth via Leveraging Semantics: Self-supervised Monocular Depth Estimation with Both Implicit and Explicit Semantic Guidance

02/11/2021
by   Rui Li, et al.
0

Self-supervised depth estimation has made a great success in learning depth from unlabeled image sequences. While the mappings between image and pixel-wise depth are well-studied in current methods, the correlation between image, depth and scene semantics, however, is less considered. This hinders the network to better understand the real geometry of the scene, since the contextual clues, contribute not only the latent representations of scene depth, but also the straight constraints for depth map. In this paper, we leverage the two benefits by proposing the implicit and explicit semantic guidance for accurate self-supervised depth estimation. We propose a Semantic-aware Spatial Feature Alignment (SSFA) scheme to effectively align implicit semantic features with depth features for scene-aware depth estimation. We also propose a semantic-guided ranking loss to explicitly constrain the estimated depth maps to be consistent with real scene contextual properties. Both semantic label noise and prediction uncertainty is considered to yield reliable depth supervisions. Extensive experimental results show that our method produces high quality depth maps which are consistently superior either on complex scenes or diverse semantic categories, and outperforms the state-of-the-art methods by a significant margin.

READ FULL TEXT

page 1

page 3

page 5

page 7

page 8

research
12/15/2020

Semantic-Guided Representation Enhancement for Self-supervised Monocular Trained Depth Estimation

Self-supervised depth estimation has shown its great effectiveness in pr...
research
08/19/2021

Fine-grained Semantics-aware Representation Enhancement for Self-supervised Monocular Depth Estimation

Self-supervised monocular depth estimation has been widely studied, owin...
research
10/06/2020

SAFENet: Self-Supervised Monocular Depth Estimation with Semantic-Aware Feature Extraction

Self-supervised monocular depth estimation has emerged as a promising me...
research
06/07/2021

Self-supervised Depth Estimation Leveraging Global Perception and Geometric Smoothness Using On-board Videos

Self-supervised depth estimation has drawn much attention in recent year...
research
03/15/2019

SceneCode: Monocular Dense Semantic Reconstruction using Learned Encoded Scene Representations

Systems which incrementally create 3D semantic maps from image sequences...
research
11/08/2021

Residual-Guided Learning Representation for Self-Supervised Monocular Depth Estimation

Photometric consistency loss is one of the representative objective func...
research
03/15/2019

SceneCode: Monocular Dense Semantic Reconstruction using Learned Encoded Scene Representation

Systems which incrementally create 3D semantic maps from image sequences...

Please sign up or login with your details

Forgot password? Click here to reset