DeepAI
Log In Sign Up

DF-VO: What Should Be Learnt for Visual Odometry?

03/01/2021
by   Huangying Zhan, et al.
18

Multi-view geometry-based methods dominate the last few decades in monocular Visual Odometry for their superior performance, while they have been vulnerable to dynamic and low-texture scenes. More importantly, monocular methods suffer from scale-drift issue, i.e., errors accumulate over time. Recent studies show that deep neural networks can learn scene depths and relative camera in a self-supervised manner without acquiring ground truth labels. More surprisingly, they show that the well-trained networks enable scale-consistent predictions over long videos, while the accuracy is still inferior to traditional methods because of ignoring geometric information. Building on top of recent progress in computer vision, we design a simple yet robust VO system by integrating multi-view geometry and deep learning on Depth and optical Flow, namely DF-VO. In this work, a) we propose a method to carefully sample high-quality correspondences from deep flows and recover accurate camera poses with a geometric module; b) we address the scale-drift issue by aligning geometrically triangulated depths to the scale-consistent deep depths, where the dynamic scenes are taken into account. Comprehensive ablation studies show the effectiveness of the proposed method, and extensive evaluation results show the state-of-the-art performance of our system, e.g., Ours (1.652%) v.s. ORB-SLAM (3.247%}) in terms of translation error in KITTI Odometry benchmark. Source code is publicly available at: \href{https://github.com/Huangying-Zhan/DF-VO}{DF-VO}.

READ FULL TEXT

page 1

page 4

page 6

09/21/2019

Visual Odometry Revisited: What Should Be Learnt?

In this work we present a monocular visual odometry (VO) algorithm which...
04/03/2020

Towards Better Generalization: Joint Depth-Pose Learning without PoseNet

In this work, we tackle the essential problem of scale inconsistency for...
12/11/2019

Training Deep SLAM on Single Frames

Learning-based visual odometry and SLAM methods demonstrate a steady imp...
03/18/2021

Deep Online Correction for Monocular Visual Odometry

In this work, we propose a novel deep online correction (DOC) framework ...
07/25/2020

Crowdsourced 3D Mapping: A Combined Multi-View Geometry and Self-Supervised Learning Approach

The ability to efficiently utilize crowdsourced visual data carries imme...
09/21/2021

Scale-aware direct monocular odometry

We present a framework for direct monocular odometry based on depth pred...
08/08/2022

Deep Patch Visual Odometry

We propose Deep Patch Visual Odometry (DPVO), a new deep learning system...

Code Repositories

DF-VO

Depth and Flow for Visual Odometry


view repo

toposlam

Topological SLAM: Deep Visual Odometry with Long Term Place Recognition (Loop Closure Detection)


view repo