BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation

04/03/2022
by   Zhenyu Li, et al.
1

Monocular depth estimation is a fundamental task in computer vision and has drawn increasing attention. Recently, some methods reformulate it as a classification-regression task to boost the model performance, where continuous depth is estimated via a linear combination of predicted probability distributions and discrete bins. In this paper, we present a novel framework called BinsFormer, tailored for the classification-regression-based depth estimation. It mainly focuses on two crucial components in the specific task: 1) proper generation of adaptive bins and 2) sufficient interaction between probability distribution and bins predictions. To specify, we employ the Transformer decoder to generate bins, novelly viewing it as a direct set-to-set prediction problem. We further integrate a multi-scale decoder structure to achieve a comprehensive understanding of spatial geometry information and estimate depth maps in a coarse-to-fine manner. Moreover, an extra scene understanding query is proposed to improve the estimation accuracy, which turns out that models can implicitly learn useful information from an auxiliary environment classification task. Extensive experiments on the KITTI, NYU, and SUN RGB-D datasets demonstrate that BinsFormer surpasses state-of-the-art monocular depth estimation methods with prominent margins. Code and pretrained models will be made publicly available at <https://github.com/zhyever/Monocular-Depth-Estimation-Toolbox>.

READ FULL TEXT

page 12

page 13

page 14

page 19

page 20

page 21

research
03/27/2022

DepthFormer: Exploiting Long-Range Correlation and Local Information for Accurate Monocular Depth Estimation

This paper aims to address the problem of supervised monocular depth est...
research
07/10/2022

Depthformer : Multiscale Vision Transformer For Monocular Depth Estimation With Local Global Information Fusion

Attention-based models such as transformers have shown outstanding perfo...
research
08/03/2022

Gradient-based Uncertainty for Monocular Depth Estimation

In monocular depth estimation, disturbances in the image context, like m...
research
08/03/2022

Neural Contourlet Network for Monocular 360 Depth Estimation

For a monocular 360 image, depth estimation is a challenging because the...
research
11/30/2022

ObjCAViT: Improving Monocular Depth Estimation Using Natural Language Models And Image-Object Cross-Attention

While monocular depth estimation (MDE) is an important problem in comput...
research
09/02/2022

LiteDepth: Digging into Fast and Accurate Depth Estimation on Mobile Devices

Monocular depth estimation is an essential task in the computer vision c...
research
05/20/2021

M4Depth: A motion-based approach for monocular depth estimation on video sequences

Getting the distance to objects is crucial for autonomous vehicles. In i...

Please sign up or login with your details

Forgot password? Click here to reset