3D Hierarchical Refinement and Augmentation for Unsupervised Learning of Depth and Pose from Monocular Video

12/06/2021
by   Guangming Wang, et al.
0

Depth and ego-motion estimations are essential for the localization and navigation of autonomous robots and autonomous driving. Recent studies make it possible to learn the per-pixel depth and ego-motion from the unlabeled monocular video. A novel unsupervised training framework is proposed with 3D hierarchical refinement and augmentation using explicit 3D geometry. In this framework, the depth and pose estimations are hierarchically and mutually coupled to refine the estimated pose layer by layer. The intermediate view image is proposed and synthesized by warping the pixels in an image with the estimated depth and coarse pose. Then, the residual pose transformation can be estimated from the new view image and the image of the adjacent frame to refine the coarse pose. The iterative refinement is implemented in a differentiable manner in this paper, making the whole framework optimized uniformly. Meanwhile, a new image augmentation method is proposed for the pose estimation by synthesizing a new view image, which creatively augments the pose in 3D space but gets a new augmented 2D image. The experiments on KITTI demonstrate that our depth estimation achieves state-of-the-art performance and even surpasses recent approaches that utilize other auxiliary tasks. Our visual odometry outperforms all recent unsupervised monocular learning-based methods and achieves competitive performance to the geometry-based method, ORB-SLAM2 with back-end optimization.

READ FULL TEXT

page 1

page 3

page 5

page 7

page 10

research
03/03/2020

DiPE: Deeper into Photometric Errors for Unsupervised Learning of Depth and Ego-motion from Monocular Videos

Unsupervised learning of depth and ego-motion from unlabelled monocular ...
research
09/16/2018

GANVO: Unsupervised Deep Monocular Visual Odometry and Depth Estimation with Generative Adversarial Networks

In the last decade, supervised deep learning approaches have been extens...
research
03/11/2018

Unsupervised Learning of Monocular Depth Estimation and Visual Odometry with Deep Feature Reconstruction

Despite learning based methods showing promising results in single view ...
research
03/24/2022

Unsupervised Simultaneous Learning for Camera Re-Localization and Depth Estimation from Video

We present an unsupervised simultaneous learning framework for the task ...
research
12/30/2019

Video Depth Estimation by Fusing Flow-to-Depth Proposals

We present an approach with a novel differentiable flow-to-depth layer f...
research
12/23/2018

Epipolar Geometry based Learning of Multi-view Depth and Ego-Motion from Monocular Sequences

Deep approaches to predict monocular depth and ego-motion have grown in ...
research
01/20/2022

GeoFill: Reference-Based Image Inpainting of Scenes with Complex Geometry

Reference-guided image inpainting restores image pixels by leveraging th...

Please sign up or login with your details

Forgot password? Click here to reset