Self-supervised Learning with Geometric Constraints in Monocular Video: Connecting Flow, Depth, and Camera

07/12/2019
by   Yuhua Chen, et al.
1

We present GLNet, a self-supervised framework for learning depth, optical flow, camera pose and intrinsic parameters from monocular video -- addressing the difficulty of acquiring realistic ground-truth for such tasks. We propose three contributions: 1) we design new loss functions that capture multiple geometric constraints (eg. epipolar geometry) as well as adaptive photometric costs that support multiple moving objects, rigid and non-rigid, 2) we extend the model such that it predicts camera intrinsics, making it applicable to uncalibrated video, and 3) we propose several online finetuning strategies that rely on the symmetry of our self-supervised loss in both training and testing, in particular optimizing model parameters and/or the output of different tasks, leveraging their mutual interactions. The idea of jointly optimizing the system output, under all geometric and photometric constraints can be viewed as a dense generalization of classical bundle adjustment. We demonstrate the effectiveness of our method on KITTI and Cityscapes, where we outperform previous self-supervised approaches on multiple tasks. We also show good generalization for transfer learning.

READ FULL TEXT

page 3

page 7

page 8

research
08/09/2021

Self-supervised Learning of Occlusion Aware Flow Guided 3D Geometry Perception with Adaptive Cross Weighted Loss from Monocular Videos

Self-supervised deep learning-based 3D scene understanding methods can o...
research
09/14/2022

DevNet: Self-supervised Monocular Depth Learning via Density Volume Construction

Self-supervised depth learning from monocular images normally relies on ...
research
05/30/2021

Unsupervised Joint Learning of Depth, Optical Flow, Ego-motion from Video

Estimating geometric elements such as depth, camera motion, and optical ...
research
05/07/2020

Self-Supervised Human Depth Estimation from Monocular Videos

Previous methods on estimating detailed human depth often require superv...
research
08/04/2021

Self-Supervised Learning of Depth and Ego-Motion from Video by Alternative Training and Geometric Constraints from 3D to 2D

Self-supervised learning of depth and ego-motion from unlabeled monocula...
research
05/03/2022

GeoRefine: Self-Supervised Online Depth Refinement for Accurate Dense Mapping

We present a robust and accurate depth refinement system, named GeoRefin...
research
12/04/2017

Self-supervised Learning of Motion Capture

Current state-of-the-art solutions for motion capture from a single came...

Please sign up or login with your details

Forgot password? Click here to reset