Depth from Videos in the Wild: Unsupervised Monocular Depth Learning from Unknown Cameras

04/10/2019
by   Ariel Gordon, et al.
0

We present a novel method for simultaneous learning of depth, egomotion, object motion, and camera intrinsics from monocular videos, using only consistency across neighboring video frames as supervision signal. Similarly to prior work, our method learns by applying differentiable warping to frames and comparing the result to adjacent ones, but it provides several improvements: We address occlusions geometrically and differentiably, directly using the depth maps as predicted during training. We introduce randomized layer normalization, a novel powerful regularizer, and we account for object motion relative to the scene. To the best of our knowledge, our work is the first to learn the camera intrinsic parameters, including lens distortion, from video in an unsupervised manner, thereby allowing us to extract accurate depth and motion from arbitrary videos of unknown origin at scale. We evaluate our results on the Cityscapes, KITTI and EuRoC datasets, establishing new state of the art on depth prediction and odometry, and demonstrate qualitatively that depth prediction can be learned from a collection of YouTube videos.

READ FULL TEXT

page 1

page 4

page 7

page 9

page 10

page 16

research
02/15/2018

Unsupervised Learning of Depth and Ego-Motion from Monocular Video Using 3D Geometric Constraints

We present a novel approach for unsupervised learning of depth and ego-m...
research
06/14/2022

3D scene reconstruction from monocular spherical video with motion parallax

In this paper, we describe a method to capture nearly entirely spherical...
research
08/28/2019

Unsupervised Scale-consistent Depth and Ego-motion Learning from Monocular Video

Recent work has shown that CNN-based depth and ego-motion estimators can...
research
03/09/2019

Sparse Representations for Object and Ego-motion Estimation in Dynamic Scenes

Dynamic scenes that contain both object motion and egomotion are a chall...
research
03/17/2023

MoRF: Mobile Realistic Fullbody Avatars from a Monocular Video

We present a new approach for learning Mobile Realistic Fullbody (MoRF) ...
research
05/25/2021

Unsupervised Scale-consistent Depth Learning from Video

We propose a monocular depth estimator SC-Depth, which requires only unl...
research
11/15/2018

Depth Prediction Without the Sensors: Leveraging Structure for Unsupervised Learning from Monocular Videos

Learning to predict scene depth from RGB inputs is a challenging task bo...

Please sign up or login with your details

Forgot password? Click here to reset