Unsupervised Learning of Depth, Camera Pose and Optical Flow from Monocular Video

05/19/2022
by   Dipan Mandal, et al.
0

We propose DFPNet – an unsupervised, joint learning system for monocular Depth, Optical Flow and egomotion (Camera Pose) estimation from monocular image sequences. Due to the nature of 3D scene geometry these three components are coupled. We leverage this fact to jointly train all the three components in an end-to-end manner. A single composite loss function – which involves image reconstruction-based loss for depth optical flow, bidirectional consistency checks and smoothness loss components – is used to train the network. Using hyperparameter tuning, we are able to reduce the model size to less than 5 (8.4M parameters) of state-of-the-art DFP models. Evaluation on KITTI and Cityscapes driving datasets reveals that our model achieves results comparable to state-of-the-art in all of the three tasks, even with the significantly smaller model size.

READ FULL TEXT
research
03/06/2018

GeoNet: Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose

We propose GeoNet, a jointly unsupervised learning framework for monocul...
research
12/02/2021

Dimensions of Motion: Learning to Predict a Subspace of Optical Flow from a Single Image

We introduce the problem of predicting, from a single video frame, a low...
research
03/25/2019

Learning Monocular Visual Odometry through Geometry-Aware Curriculum Learning

Inspired by the cognitive process of humans and animals, Curriculum Lear...
research
07/16/2019

Speed estimation evaluation on the KITTI benchmark based on motion and monocular depth information

In this technical report we investigate speed estimation of the ego-vehi...
research
06/08/2020

What Matters in Unsupervised Optical Flow

We systematically compare and analyze a set of key components in unsuper...
research
12/20/2018

Robustness Meets Deep Learning: An End-to-End Hybrid Pipeline for Unsupervised Learning of Egomotion

In this work, we propose a method that combines unsupervised deep learni...
research
10/21/2020

MonoComb: A Sparse-to-Dense Combination Approach for Monocular Scene Flow

Contrary to the ongoing trend in automotive applications towards usage o...

Please sign up or login with your details

Forgot password? Click here to reset