Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth

01/19/2022
by   Doyeon Kim, et al.
3

Depth estimation from a single image is an important task that can be applied to various fields in computer vision, and has grown rapidly with the development of convolutional neural networks. In this paper, we propose a novel structure and training strategy for monocular depth estimation to further improve the prediction accuracy of the network. We deploy a hierarchical transformer encoder to capture and convey the global context, and design a lightweight yet powerful decoder to generate an estimated depth map while considering local connectivity. By constructing connected paths between multi-scale local features and the global decoding stream with our proposed selective feature fusion module, the network can integrate both representations and recover fine details. In addition, the proposed decoder shows better performance than the previously proposed decoders, with considerably less computational complexity. Furthermore, we improve the depth-specific augmentation method by utilizing an important observation in depth estimation to enhance the model. Our network achieves state-of-the-art performance over the challenging depth dataset NYU Depth V2. Extensive experiments have been conducted to validate and show the effectiveness of the proposed approach. Finally, our model shows better generalisation ability and robustness than other comparative models.

READ FULL TEXT

page 2

page 5

page 11

research
09/29/2022

Lightweight Monocular Depth Estimation with an Edge Guided Network

Monocular depth estimation is an important task that can be applied to m...
research
07/13/2019

Structure-Aware Residual Pyramid Network for Monocular Depth Estimation

Monocular depth estimation is an essential task for scene understanding....
research
10/18/2022

Hierarchical Normalization for Robust Monocular Depth Estimation

In this paper, we address monocular depth estimation with deep neural ne...
research
03/03/2022

NeW CRFs: Neural Window Fully-connected CRFs for Monocular Depth Estimation

Estimating the accurate depth from a single image is challenging since i...
research
01/17/2023

SwinDepth: Unsupervised Depth Estimation using Monocular Sequences via Swin Transformer and Densely Cascaded Network

Monocular depth estimation plays a critical role in various computer vis...
research
09/14/2023

Unleashing the Power of Depth and Pose Estimation Neural Networks by Designing Compatible Endoscopic Images

Deep learning models have witnessed depth and pose estimation framework ...
research
04/16/2023

EGformer: Equirectangular Geometry-biased Transformer for 360 Depth Estimation

Estimating the depths of equirectangular (360) images (EIs) is challengi...

Please sign up or login with your details

Forgot password? Click here to reset