Computing CNN Loss and Gradients for Pose Estimation with Riemannian Geometry

by   Benjamin Hou, et al.

Pose estimation, i.e. predicting a 3D rigid transformation with respect to a fixed co-ordinate frame in, SE(3), is an omnipresent problem in medical image analysis with applications such as: image rigid registration, anatomical standard plane detection, tracking and device/camera pose estimation. Deep learning methods often parameterise a pose with a representation that separates rotation and translation. As commonly available frameworks do not provide means to calculate loss on a manifold, regression is usually performed using the L2-norm independently on the rotation's and the translation's parameterisations, which is a metric for linear spaces that does not take into account the Lie group structure of SE(3). In this paper, we propose a general Riemannian formulation of the pose estimation problem. We propose to train the CNN directly on SE(3) equipped with a left-invariant Riemannian metric, coupling the prediction of the translation and rotation defining the pose. At each training step, the ground truth and predicted pose are elements of the manifold, where the loss is calculated as the Riemannian geodesic distance. We then compute the optimisation direction by back-propagating the gradient with respect to the predicted pose on the tangent space of the manifold SE(3) and update the network weights. We thoroughly evaluate the effectiveness of our loss function by comparing its performance with popular and most commonly used existing methods, on tasks such as image-based localisation and intensity-based 2D/3D registration. We also show that hyper-parameters, used in our loss function to weight the contribution between rotations and translations, can be intrinsically calculated from the dataset to achieve greater performance margins.


page 1

page 2

page 3

page 4


Vectorial Parameterizations of Pose

Robotics and computer vision problems commonly require handling rigid-bo...

Convex Relaxations of SE(2) and SE(3) for Visual Pose Estimation

This paper proposes a new method for rigid body pose estimation based on...

Relative Pose Estimation of Calibrated Cameras with Known SE(3) Invariants

The SE(3) invariants of a pose include its rotation angle and screw tran...

Improved Pose Graph Optimization for Planar Motions Using Riemannian Geometry on the Manifold of Dual Quaternions

We present a novel Riemannian approach for planar pose graph optimizatio...

Probabilistic Rotation Representation With an Efficiently Computable Bingham Loss Function and Its Application to Pose Estimation

In recent years, a deep learning framework has been widely used for obje...

Improving Image-Based Localization with Deep Learning: The Impact of the Loss Function

This work formulates a novel loss term which can be appended to an RGB o...

Moving Frame Net: SE(3)-Equivariant Network for Volumes

Equivariance of neural networks to transformations helps to improve thei...

Please sign up or login with your details

Forgot password? Click here to reset