DevNet: Self-supervised Monocular Depth Learning via Density Volume Construction

09/14/2022
by   Kaichen Zhou, et al.
0

Self-supervised depth learning from monocular images normally relies on the 2D pixel-wise photometric relation between temporally adjacent image frames. However, they neither fully exploit the 3D point-wise geometric correspondences, nor effectively tackle the ambiguities in the photometric warping caused by occlusions or illumination inconsistency. To address these problems, this work proposes Density Volume Construction Network (DevNet), a novel self-supervised monocular depth learning framework, that can consider 3D spatial information, and exploit stronger geometric constraints among adjacent camera frustums. Instead of directly regressing the pixel value from a single image, our DevNet divides the camera frustum into multiple parallel planes and predicts the pointwise occlusion probability density on each plane. The final depth map is generated by integrating the density along corresponding rays. During the training process, novel regularization strategies and loss functions are introduced to mitigate photometric ambiguities and overfitting. Without obviously enlarging model parameters size or running time, DevNet outperforms several representative baselines on both the KITTI-2015 outdoor dataset and NYU-V2 indoor dataset. In particular, the root-mean-square-deviation is reduced by around 4 estimation. Code is available at https://github.com/gitkaichenzhou/DevNet.

READ FULL TEXT

page 8

page 14

research
07/12/2019

Self-supervised Learning with Geometric Constraints in Monocular Video: Connecting Flow, Depth, and Camera

We present GLNet, a self-supervised framework for learning depth, optica...
research
06/27/2022

MGNet: Monocular Geometric Scene Understanding for Autonomous Driving

We introduce MGNet, a multi-task framework for monocular geometric scene...
research
02/25/2019

Beyond Photometric Loss for Self-Supervised Ego-Motion Estimation

Accurate relative pose is one of the key components in visual odometry (...
research
08/19/2022

Crafting Monocular Cues and Velocity Guidance for Self-Supervised Multi-Frame Depth Learning

Self-supervised monocular methods can efficiently learn depth informatio...
research
11/10/2020

SelfDeco: Self-Supervised Monocular Depth Completion in Challenging Indoor Environments

We present a novel algorithm for self-supervised monocular depth complet...
research
05/17/2023

Self-Supervised Learning for Physiologically-Based Pharmacokinetic Modeling in Dynamic PET

Dynamic positron emission tomography imaging (dPET) provides temporally ...

Please sign up or login with your details

Forgot password? Click here to reset