DeepVO: Towards End-to-End Visual Odometry with Deep Recurrent Convolutional Neural Networks

09/25/2017
by   Sen Wang, et al.
1

This paper studies monocular visual odometry (VO) problem. Most of existing VO algorithms are developed under a standard pipeline including feature extraction, feature matching, motion estimation, local optimisation, etc. Although some of them have demonstrated superior performance, they usually need to be carefully designed and specifically fine-tuned to work well in different environments. Some prior knowledge is also required to recover an absolute scale for monocular VO. This paper presents a novel end-to-end framework for monocular VO by using deep Recurrent Convolutional Neural Networks (RCNNs). Since it is trained and deployed in an end-to-end manner, it infers poses directly from a sequence of raw RGB images (videos) without adopting any module in the conventional VO pipeline. Based on the RCNNs, it not only automatically learns effective feature representation for the VO problem through Convolutional Neural Networks, but also implicitly models sequential dynamics and relations using deep Recurrent Neural Networks. Extensive experiments on the KITTI VO dataset show competitive performance to state-of-the-art methods, verifying that the end-to-end Deep Learning technique can be a viable complement to the traditional VO systems.

READ FULL TEXT

page 1

page 3

page 7

research
11/27/2018

MagicVO: End-to-End Monocular Visual Odometry through Deep Bi-directional Recurrent Convolutional Neural Network

This paper proposes a new framework to solve the problem of monocular vi...
research
11/25/2018

Guided Feature Selection for Deep Visual Odometry

We present a novel end-to-end visual odometry architecture with guided f...
research
07/15/2020

Learning Multiplicative Interactions with Bayesian Neural Networks for Visual-Inertial Odometry

This paper presents an end-to-end multi-modal learning approach for mono...
research
04/17/2019

DistanceNet: Estimating Traveled Distance from Monocular Images using a Recurrent Convolutional Neural Network

Classical monocular vSLAM/VO methods suffer from the scale ambiguity pro...
research
07/24/2017

Feature Extraction via Recurrent Random Deep Ensembles and its Application in Gruop-level Happiness Estimation

This paper presents a novel ensemble framework to extract highly discrim...
research
09/21/2019

Visual Odometry Revisited: What Should Be Learnt?

In this work we present a monocular visual odometry (VO) algorithm which...
research
03/20/2018

Stacked Neural Networks for end-to-end ciliary motion analysis

Cilia are hairlike structures protruding from nearly every cell in the b...

Please sign up or login with your details

Forgot password? Click here to reset