Full-Frame Scene Coordinate Regression for Image-Based Localization

02/09/2018
by   Xiaotian Li, et al.
0

Image-based localization, or camera relocalization, is a fundamental problem in computer vision and robotics, and it refers to estimating camera pose from an image. Recent state-of-the-art approaches use learning based methods, such as Random Forests (RFs) and Convolutional Neural Networks (CNNs), to regress for each pixel in the image its corresponding position in the scene's world coordinate frame, and solve the final pose via a RANSAC-based optimization scheme using the predicted correspondences. In this paper, instead of in a patch-based manner, we propose to perform the scene coordinate regression in a full-frame manner to make the computation efficient at test time and, more importantly, to add more global context to the regression process to improve the robustness. To do so, we adopt a fully convolutional encoder-decoder neural network architecture which accepts a whole image as input and produces scene coordinate predictions for all pixels in the image. However, using more global context is prone to overfitting. To alleviate this issue, we propose to use data augmentation to generate more data for training. In addition to the data augmentation in 2D image space, we also augment the data in 3D space. We evaluate our approach on the publicly available 7-Scenes dataset, and experiments show that it has better scene coordinate predictions and achieves state-of-the-art results in localization with improved robustness on the hardest frames (e.g., frames with repeated structures).

READ FULL TEXT

page 2

page 4

research
09/19/2016

Random Forests versus Neural Networks - What's Best for Camera Localization?

This work addresses the task of camera localization in a known 3D scene ...
research
08/15/2018

Scene Coordinate Regression with Angle-Based Reprojection Loss for Camera Relocalization

Image-based camera relocalization is an important problem in computer vi...
research
03/23/2017

Image-based Localization using Hourglass Networks

In this paper, we propose an encoder-decoder convolutional neural networ...
research
10/22/2017

Backtracking Regression Forests for Accurate Camera Relocalization

Camera relocalization plays a vital role in many robotics and computer v...
research
05/22/2018

Scene Coordinate and Correspondence Learning for Image-Based Localization

Scene coordinate regression has become an essential part of current came...
research
06/20/2019

Let's Take This Online: Adapting Scene Coordinate Regression Network Predictions for Online RGB-D Camera Relocalisation

Many applications require a camera to be relocalised online, without exp...
research
03/24/2020

KFNet: Learning Temporal Camera Relocalization using Kalman Filtering

Temporal camera relocalization estimates the pose with respect to each v...

Please sign up or login with your details

Forgot password? Click here to reset