Boosting Monocular Depth Estimation with Sparse Guided Points

02/03/2022
by   Guangkai Xu, et al.
15

Existing monocular depth estimation shows excellent robustness in the wild, but the affine-invariant prediction requires aligning with the ground truth globally while being converted into the metric depth. In this work, we firstly propose a modified locally weighted linear regression strategy to leverage sparse ground truth and generate a flexible depth transformation to correct the coarse misalignment brought by global recovery strategy. Applying this strategy, we achieve significant improvement (more than 50 recent state-of-the-art methods on five zero-shot datasets. Moreover, we train a robust depth estimation model with 6.3 million data and analyze the training process by decoupling the inaccuracy into coarse misalignment inaccuracy and detail missing inaccuracy. As a result, our model based on ResNet50 even outperforms the state-of-the-art DPT ViT-Large model with the help of our recovery strategy. In addition to accuracy, the consistency is also boosted for simple per-frame video depth estimation. Compared with monocular depth estimation, robust video depth estimation, and depth completion methods, our pipeline obtains state-of-the-art performance on video depth estimation without any post-processing. Experiments of 3D scene reconstruction from consistent video depth are conducted for intuitive comparison as well.

READ FULL TEXT

page 3

page 7

page 8

page 13

page 14

page 15

research
03/16/2018

Monocular Fisheye Camera Depth Estimation Using Semi-supervised Sparse Velodyne Data

Near field depth estimation around a self driving car is an important fu...
research
03/21/2023

Monocular Visual-Inertial Depth Estimation

We present a visual-inertial depth estimation pipeline that integrates m...
research
07/02/2019

Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-Shot Cross-Dataset Transfer

The success of monocular depth estimation relies on large and diverse tr...
research
10/18/2022

Hierarchical Normalization for Robust Monocular Depth Estimation

In this paper, we address monocular depth estimation with deep neural ne...
research
06/03/2018

Soccer on Your Tabletop

We present a system that transforms a monocular video of a soccer game i...
research
11/19/2021

ColDE: A Depth Estimation Framework for Colonoscopy Reconstruction

One of the key elements of reconstructing a 3D mesh from a monocular vid...
research
03/04/2021

Learning High Fidelity Depths of Dressed Humans by Watching Social Media Dance Videos

A key challenge of learning the geometry of dressed humans lies in the l...

Please sign up or login with your details

Forgot password? Click here to reset