DS-Depth: Dynamic and Static Depth Estimation via a Fusion Cost Volume

08/14/2023
by   Xingyu Miao, et al.
0

Self-supervised monocular depth estimation methods typically rely on the reprojection error to capture geometric relationships between successive frames in static environments. However, this assumption does not hold in dynamic objects in scenarios, leading to errors during the view synthesis stage, such as feature mismatch and occlusion, which can significantly reduce the accuracy of the generated depth maps. To address this problem, we propose a novel dynamic cost volume that exploits residual optical flow to describe moving objects, improving incorrectly occluded regions in static cost volumes used in previous work. Nevertheless, the dynamic cost volume inevitably generates extra occlusions and noise, thus we alleviate this by designing a fusion module that makes static and dynamic cost volumes compensate for each other. In other words, occlusion from the static volume is refined by the dynamic volume, and incorrect information from the dynamic volume is eliminated by the static volume. Furthermore, we propose a pyramid distillation loss to reduce photometric error inaccuracy at low resolutions and an adaptive photometric error loss to alleviate the flow direction of the large gradient in the occlusion regions. We conducted extensive experiments on the KITTI and Cityscapes datasets, and the results demonstrate that our model outperforms previously published baselines for self-supervised monocular depth estimation.

READ FULL TEXT

page 1

page 3

page 4

page 8

page 9

page 10

page 11

research
06/17/2020

Self-Supervised Joint Learning Framework of Depth Estimation via Implicit Cues

In self-supervised monocular depth estimation, the depth discontinuity a...
research
04/29/2021

The Temporal Opportunist: Self-Supervised Multi-Frame Monocular Depth

Self-supervised monocular depth estimation networks are trained to predi...
research
08/29/2019

Improving Self-Supervised Single View Depth Estimation by Masking Occlusion

Single view depth estimation models can be trained from video footage us...
research
08/09/2021

Self-supervised Learning of Occlusion Aware Flow Guided 3D Geometry Perception with Adaptive Cross Weighted Loss from Monocular Videos

Self-supervised deep learning-based 3D scene understanding methods can o...
research
11/24/2020

MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera

In this paper, we propose MonoRec, a semi-supervised monocular dense rec...
research
12/12/2022

CbwLoss: Constrained Bidirectional Weighted Loss for Self-supervised Learning of Depth and Pose

Photometric differences are widely used as supervision signals to train ...
research
05/20/2022

Self-Supervised Depth Estimation with Isometric-Self-Sample-Based Learning

Managing the dynamic regions in the photometric loss formulation has bee...

Please sign up or login with your details

Forgot password? Click here to reset