One scalar is all you need – absolute depth estimation using monocular self-supervision

03/14/2023
by   Alexandra Dana, et al.
0

Self-supervised monocular depth estimators can be trained or fine-tuned on new scenes using only images and no ground-truth depth data, achieving good accuracy. However, these estimators suffer from the inherent ambiguity of the depth scale, significantly limiting their applicability. In this work, we present a method for transferring the depth-scale from existing source datasets collected with ground-truth depths to depth estimators that are trained using self-supervision on a newly collected target dataset consisting of images only, solving a significant limiting factor. We show that self-supervision based on projective geometry results in predicted depths that are linearly correlated with their ground-truth depths. Moreover, the linearity of this relationship also holds when jointly training on images from two different (real or synthetic) source and target domains. We utilize this observed property and model the relationship between the ground-truth and the predicted up-to-scale depths of images from the source domain using a single global scalar. Then, we scale the predicted up-to-scale depths of images from the target domain using the estimated global scaling factor, performing depth-scale transfer between the two domains. This suggested method was evaluated on the target KITTI and DDAD datasets, while using other real or synthetic source datasets, that have a larger field-of-view, other image style or structural content. Our approach achieves competitive accuracy on KITTI, even without using the specially tailored vKITTI or vKITTI2 datasets, and higher accuracy on DDAD, when using both real or synthetic source datasets.

READ FULL TEXT

page 1

page 3

page 11

page 16

page 17

page 18

page 19

research
06/04/2018

Digging Into Self-Supervised Monocular Depth Estimation

Depth-sensing is important for both navigation and scene understanding. ...
research
12/10/2022

Mind The Edge: Refining Depth Edges in Sparsely-Supervised Monocular Depth Estimation

Monocular Depth Estimation (MDE) is a fundamental problem in computer vi...
research
05/06/2019

PackNet-SfM: 3D Packing for Self-Supervised Monocular Depth Estimation

Densely estimating the depth of a scene from a single image is an ill-po...
research
09/16/2020

Calibrating Self-supervised Monocular Depth Estimation

In the recent years, many methods demonstrated the ability of neural net...
research
10/20/2021

Depth360: Monocular Depth Estimation using Learnable Axisymmetric Camera Model for Spherical Camera Image

Self-supervised monocular depth estimation has been widely investigated ...
research
10/08/2022

Detaching and Boosting: Dual Engine for Scale-Invariant Self-Supervised Monocular Depth Estimation

Monocular depth estimation (MDE) in the self-supervised scenario has eme...
research
02/08/2023

SkyEye: Self-Supervised Bird's-Eye-View Semantic Mapping Using Monocular Frontal View Images

Bird's-Eye-View (BEV) semantic maps have become an essential component o...

Please sign up or login with your details

Forgot password? Click here to reset