DeepAI AI Chat
Log In Sign Up

Visual Attention-based Self-supervised Absolute Depth Estimation using Geometric Priors in Autonomous Driving

by   Jie Xiang, et al.

Although existing monocular depth estimation methods have made great progress, predicting an accurate absolute depth map from a single image is still challenging due to the limited modeling capacity of networks and the scale ambiguity issue. In this paper, we introduce a fully Visual Attention-based Depth (VADepth) network, where spatial attention and channel attention are applied to all stages. By continuously extracting the dependencies of features along the spatial and channel dimensions over a long distance, VADepth network can effectively preserve important details and suppress interfering features to better perceive the scene structure for more accurate depth estimates. In addition, we utilize geometric priors to form scale constraints for scale-aware model training. Specifically, we construct a novel scale-aware loss using the distance between the camera and a plane fitted by the ground points corresponding to the pixels of the rectangular area in the bottom middle of the image. Experimental results on the KITTI dataset show that this architecture achieves the state-of-the-art performance and our method can directly output absolute depth without post-processing. Moreover, our experiments on the SeasonDepth dataset also demonstrate the robustness of our model to multiple unseen environments.


page 1

page 6

page 7


Channel-Wise Attention-Based Network for Self-Supervised Monocular Depth Estimation

Self-supervised learning has shown very promising results for monocular ...

Depth Monocular Estimation with Attention-based Encoder-Decoder Network from Single Image

Depth information is the foundation of perception, essential for autonom...

PackNet-SfM: 3D Packing for Self-Supervised Monocular Depth Estimation

Densely estimating the depth of a scene from a single image is an ill-po...

Detaching and Boosting: Dual Engine for Scale-Invariant Self-Supervised Monocular Depth Estimation

Monocular depth estimation (MDE) in the self-supervised scenario has eme...

Deep Planar Parallax for Monocular Depth Estimation

Depth estimation is a fundamental problem in the perception system of au...

S&CNet: A Enhanced Coarse-to-fine Framework For Monocular Depth Completion

Real-time depth completing is a critical problem for robotics and autono...