iDisc: Internal Discretization for Monocular Depth Estimation

04/13/2023
by   Luigi Piccinelli, et al.
0

Monocular depth estimation is fundamental for 3D scene understanding and downstream applications. However, even under the supervised setup, it is still challenging and ill-posed due to the lack of full geometric constraints. Although a scene can consist of millions of pixels, there are fewer high-level patterns. We propose iDisc to learn those patterns with internal discretized representations. The method implicitly partitions the scene into a set of high-level patterns. In particular, our new module, Internal Discretization (ID), implements a continuous-discrete-continuous bottleneck to learn those concepts without supervision. In contrast to state-of-the-art methods, the proposed model does not enforce any explicit constraints or priors on the depth output. The whole network with the ID module can be trained end-to-end, thanks to the bottleneck module based on attention. Our method sets the new state of the art with significant improvements on NYU-Depth v2 and KITTI, outperforming all published methods on the official KITTI benchmark. iDisc can also achieve state-of-the-art results on surface normal estimation. Further, we explore the model generalization capability via zero-shot testing. We observe the compelling need to promote diversification in the outdoor scenario. Hence, we introduce splits of two autonomous driving datasets, DDAD and Argoverse. Code is available at http://vis.xyz/pub/idisc .

READ FULL TEXT

page 5

page 8

page 12

page 14

page 15

page 16

page 17

page 18

research
06/16/2021

EdgeConv with Attention Module for Monocular Depth Estimation

Monocular depth estimation is an especially important task in robotics a...
research
04/05/2022

P3Depth: Monocular Depth Estimation with a Piecewise Planarity Prior

Monocular depth estimation is vital for scene understanding and downstre...
research
06/29/2023

Towards Zero-Shot Scale-Aware Monocular Depth Estimation

Monocular depth estimation is scale-ambiguous, and thus requires scale s...
research
07/14/2021

MSFNet:Multi-scale features network for monocular depth estimation

In recent years, monocular depth estimation is applied to understand the...
research
03/14/2023

A Simple Baseline for Supervised Surround-view Depth Estimation

Depth estimation has been widely studied and serves as the fundamental s...
research
07/28/2022

Depth Field Networks for Generalizable Multi-view Scene Representation

Modern 3D computer vision leverages learning to boost geometric reasonin...
research
07/10/2021

A Weakly-Supervised Depth Estimation Network Using Attention Mechanism

Monocular depth estimation (MDE) is a fundamental task in many applicati...

Please sign up or login with your details

Forgot password? Click here to reset