Geometric Loss Functions for Camera Pose Regression with Deep Learning

04/02/2017
by   Alex Kendall, et al.
0

Deep learning has shown to be effective for robust and real-time monocular image relocalisation. In particular, PoseNet is a deep convolutional neural network which learns to regress the 6-DOF camera pose from a single image. It learns to localize using high level features and is robust to difficult lighting, motion blur and unknown camera intrinsics, where point based SIFT registration fails. However, it was trained using a naive loss function, with hyper-parameters which require expensive tuning. In this paper, we give the problem a more fundamental theoretical treatment. We explore a number of novel loss functions for learning camera pose which are based on geometry and scene reprojection error. Additionally we show how to automatically learn an optimal weighting to simultaneously regress position and orientation. By leveraging geometry, we demonstrate that our technique significantly improves PoseNet's performance across datasets ranging from indoor rooms to a small city.

READ FULL TEXT

page 1

page 6

research
05/27/2015

PoseNet: A Convolutional Network for Real-Time 6-DOF Camera Relocalization

We present a robust and real-time monocular six degree of freedom reloca...
research
02/24/2018

Euler angles based loss function for camera relocalization with Deep learning

Deep learning has been applied to camera relocalization, in particular, ...
research
05/04/2022

Homography-Based Loss Function for Camera Pose Regression

Some recent visual-based relocalization algorithms rely on deep learning...
research
06/26/2019

On the Role of Geometry in Geo-Localization

Humans can build a mental map of a geographical area to find their way a...
research
02/10/2023

CGA-PoseNet: Camera Pose Regression via a 1D-Up Approach to Conformal Geometric Algebra

We introduce CGA-PoseNet, which uses the 1D-Up approach to Conformal Geo...
research
08/23/2017

3D Morphable Models as Spatial Transformer Networks

In this paper, we show how a 3D Morphable Model (i.e. a statistical mode...
research
05/13/2020

3D Scene Geometry-Aware Constraint for Camera Localization with Deep Learning

Camera localization is a fundamental and key component of autonomous dri...

Please sign up or login with your details

Forgot password? Click here to reset