VidLoc: A Deep Spatio-Temporal Model for 6-DoF Video-Clip Relocalization

02/21/2017
by   Ronald Clark, et al.
1

Machine learning techniques, namely convolutional neural networks (CNN) and regression forests, have recently shown great promise in performing 6-DoF localization of monocular images. However, in most cases image-sequences, rather only single images, are readily available. To this extent, none of the proposed learning-based approaches exploit the valuable constraint of temporal smoothness, often leading to situations where the per-frame error is larger than the camera motion. In this paper we propose a recurrent model for performing 6-DoF localization of video-clips. We find that, even by considering only short sequences (20 frames), the pose estimates are smoothed and the localization error can be drastically reduced. Finally, we consider means of obtaining probabilistic pose estimates from our model. We evaluate our method on openly-available real-world autonomous driving and indoor localization datasets.

READ FULL TEXT

page 1

page 3

page 7

page 8

research
12/23/2021

Multi-Camera Sensor Fusion for Visual Odometry using Deep Uncertainty Estimation

Visual Odometry (VO) estimation is an important source of information fo...
research
10/02/2019

Object Parsing in Sequences Using CoordConv Gated Recurrent Networks

We present a monocular object parsing framework for consistent keypoint ...
research
04/28/2022

Spatio-Temporal Graph Localization Networks for Image-based Navigation

Localization in topological maps is essential for image-based navigation...
research
06/23/2020

PoseGAN: A Pose-to-Image Translation Framework for Camera Localization

Camera localization is a fundamental requirement in robotics and compute...
research
03/05/2019

Robot Localization in Floor Plans Using a Room Layout Edge Extraction Network

Indoor localization is one of the crucial enablers for deployment of ser...
research
05/13/2020

3D Scene Geometry-Aware Constraint for Camera Localization with Deep Learning

Camera localization is a fundamental and key component of autonomous dri...
research
02/27/2019

Single-frame Regularization for Temporally Stable CNNs

Convolutional neural networks (CNNs) can model complicated non-linear re...

Please sign up or login with your details

Forgot password? Click here to reset