Training Deep SLAM on Single Frames

12/11/2019
by   Igor Slinko, et al.
29

Learning-based visual odometry and SLAM methods demonstrate a steady improvement over past years. However, collecting ground truth poses to train these methods is difficult and expensive. This could be resolved by training in an unsupervised mode, but there is still a large gap between performance of unsupervised and supervised methods. In this work, we focus on generating synthetic data for deep learning-based visual odometry and SLAM methods that take optical flow as an input. We produce training data in a form of optical flow that corresponds to arbitrary camera movement between a real frame and a virtual frame. For synthesizing data we use depth maps either produced by a depth sensor or estimated from stereo pair. We train visual odometry model on synthetic data and do not use ground truth poses hence this model can be considered unsupervised. Also it can be classified as monocular as we do not use depth maps on inference. We also propose a simple way to convert any visual odometry model into a SLAM method based on frame matching and graph optimization. We demonstrate that both the synthetically-trained visual odometry model and the proposed SLAM method build upon this model yields state-of-the-art results among unsupervised methods on KITTI dataset and shows promising results on a challenging EuRoC dataset.

READ FULL TEXT

page 1

page 4

page 11

research
05/12/2022

Dynamic Dense RGB-D SLAM using Learning-based Visual Odometry

We propose a dense dynamic RGB-D SLAM pipeline based on a learning-based...
research
01/07/2020

AD-VO: Scale-Resilient Visual Odometry Using Attentive Disparity Map

Visual odometry is an essential key for a localization module in SLAM sy...
research
04/26/2019

GN-Net: The Gauss-Newton Loss for Deep Direct SLAM

Direct methods for SLAM have shown exceptional performance on odometry t...
research
01/03/2023

BS3D: Building-scale 3D Reconstruction from RGB-D Images

Various datasets have been proposed for simultaneous localization and ma...
research
08/19/2020

MineNav: An Expandable Synthetic Dataset Based on Minecraft for Aircraft Visual Navigation

We propose a simply method to generate high quality synthetic dataset ba...
research
03/01/2021

DF-VO: What Should Be Learnt for Visual Odometry?

Multi-view geometry-based methods dominate the last few decades in monoc...
research
05/05/2021

Moving SLAM: Fully Unsupervised Deep Learning in Non-Rigid Scenes

We propose a method to train deep networks to decompose videos into 3D g...

Please sign up or login with your details

Forgot password? Click here to reset