Unsupervised Depth Learning in Challenging Indoor Video: Weak Rectification to Rescue

06/04/2020
by   Jia-Wang Bian, et al.
0

Single-view depth estimation using CNNs trained from unlabelled videos has shown significant promise. However, the excellent results have mostly been obtained in street-scene driving scenarios, and such methods often fail in other settings, particularly indoor videos taken by handheld devices, in which case the ego-motion is often degenerate, i.e., the rotation dominates the translation. In this work, we establish that the degenerate camera motions exhibited in handheld settings are a critical obstacle for unsupervised depth learning. A main contribution of our work is fundamental analysis which shows that the rotation behaves as noise during training, as opposed to the translation (baseline) which provides supervision signals. To capitalise on our findings, we propose a novel data pre-processing method for effective training, i.e., we search for image pairs with modest translation and remove their rotation via the proposed weak image rectification. With our pre-processing, existing unsupervised models can be trained well in challenging scenarios (e.g., NYUv2 dataset), and the results outperform the unsupervised SOTA by a large margin (0.147 vs. 0.189 in the AbsRel error).

READ FULL TEXT

page 8

page 15

page 16

research
11/15/2018

Depth Prediction Without the Sensors: Leveraging Structure for Unsupervised Learning from Monocular Videos

Learning to predict scene depth from RGB inputs is a challenging task bo...
research
01/17/2017

Computing Egomotion with Local Loop Closures for Egocentric Videos

Finding the camera pose is an important step in many egocentric video ap...
research
05/05/2021

Moving SLAM: Fully Unsupervised Deep Learning in Non-Rigid Scenes

We propose a method to train deep networks to decompose videos into 3D g...
research
06/08/2020

Semantics-Driven Unsupervised Learning for Monocular Depth and Ego-Motion Estimation

We propose a semantics-driven unsupervised learning approach for monocul...
research
06/27/2021

Indoor Panorama Planar 3D Reconstruction via Divide and Conquer

Indoor panorama typically consists of human-made structures parallel or ...
research
10/20/2019

Moving Indoor: Unsupervised Video Depth Learning in Challenging Environments

Recently unsupervised learning of depth from videos has made remarkable ...

Please sign up or login with your details

Forgot password? Click here to reset