TartanVO: A Generalizable Learning-based VO

10/31/2020
by   Wenshan Wang, et al.
0

We present the first learning-based visual odometry (VO) model, which generalizes to multiple datasets and real-world scenarios and outperforms geometry-based methods in challenging scenes. We achieve this by leveraging the SLAM dataset TartanAir, which provides a large amount of diverse synthetic data in challenging environments. Furthermore, to make our VO model generalize across datasets, we propose an up-to-scale loss function and incorporate the camera intrinsic parameters into the model. Experiments show that a single model, TartanVO, trained only on synthetic data, without any finetuning, can be generalized to real-world datasets such as KITTI and EuRoC, demonstrating significant advantages over the geometry-based methods on challenging trajectories. Our code is available at https://github.com/castacks/tartanvo.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/16/2021

CrossLoc: Scalable Aerial Localization Assisted by Multimodal Synthetic Data

We present a visual localization system that learns to estimate camera p...
research
03/11/2018

Unsupervised Learning of Monocular Depth Estimation and Visual Odometry with Deep Feature Reconstruction

Despite learning based methods showing promising results in single view ...
research
08/16/2022

TRoVE: Transforming Road Scene Datasets into Photorealistic Virtual Environments

High-quality structured data with rich annotations are critical componen...
research
09/02/2023

ASF-Net: Robust Video Deraining via Temporal Alignment and Online Adaptive Learning

In recent times, learning-based methods for video deraining have demonst...
research
06/02/2023

Bilevel Fast Scene Adaptation for Low-Light Image Enhancement

Enhancing images in low-light scenes is a challenging but widely concern...
research
02/02/2023

Empirical Analysis of the AdaBoost's Error Bound

Understanding the accuracy limits of machine learning algorithms is esse...
research
11/09/2020

Learning to Localize in New Environments from Synthetic Training Data

Most existing approaches for visual localization either need a detailed ...

Please sign up or login with your details

Forgot password? Click here to reset