Learning to Localize in Unseen Scenes with Relative Pose Regressors

03/05/2023
by   Ofer Idan, et al.
0

Relative pose regressors (RPRs) localize a camera by estimating its relative translation and rotation to a pose-labelled reference. Unlike scene coordinate regression and absolute pose regression methods, which learn absolute scene parameters, RPRs can (theoretically) localize in unseen environments, since they only learn the residual pose between camera pairs. In practice, however, the performance of RPRs is significantly degraded in unseen scenes. In this work, we propose to aggregate paired feature maps into latent codes, instead of operating on global image descriptors, in order to improve the generalization of RPRs. We implement aggregation with concatenation, projection, and attention operations (Transformer Encoders) and learn to regress the relative pose parameters from the resulting latent codes. We further make use of a recently proposed continuous representation of rotation matrices, which alleviates the limitations of the commonly used quaternions. Compared to state-of-the-art RPRs, our model is shown to localize significantly better in unseen environments, across both indoor and outdoor benchmarks, while maintaining competitive performance in seen scenes. We validate our findings and architecture design through multiple ablations. Our code and pretrained models is publicly available.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/21/2021

Learning Multi-Scene Absolute Pose Regression with Transformers

Absolute camera pose regressors estimate the position and orientation of...
research
08/22/2023

Coarse-to-Fine Multi-Scene Pose Regression with Transformers

Absolute camera pose regressors estimate the position and orientation of...
research
04/03/2023

RePAST: Relative Pose Attention Scene Representation Transformer

The Scene Representation Transformer (SRT) is a recent method to render ...
research
03/21/2021

Paying Attention to Activation Maps in Camera Pose Regression

Camera pose regression methods apply a single forward pass to the query ...
research
12/22/2020

Do We Really Need Scene-specific Pose Encoders?

Visual pose regression models estimate the camera pose from a query imag...
research
04/08/2021

Direct-PoseNet: Absolute Pose Regression with Photometric Consistency

We present a relocalization pipeline, which combines an absolute pose re...
research
08/18/2022

The 8-Point Algorithm as an Inductive Bias for Relative Pose Prediction by ViTs

We present a simple baseline for directly estimating the relative pose (...

Please sign up or login with your details

Forgot password? Click here to reset