Camera Pose Auto-Encoders for Improving Pose Regression

07/12/2022
by   Yoli Shavit, et al.
0

Absolute pose regressor (APR) networks are trained to estimate the pose of the camera given a captured image. They compute latent image representations from which the camera position and orientation are regressed. APRs provide a different tradeoff between localization accuracy, runtime, and memory, compared to structure-based localization schemes that provide state-of-the-art accuracy. In this work, we introduce Camera Pose Auto-Encoders (PAEs), multilayer perceptrons that are trained via a Teacher-Student approach to encode camera poses using APRs as their teachers. We show that the resulting latent pose representations can closely reproduce APR performance and demonstrate their effectiveness for related tasks. Specifically, we propose a light-weight test-time optimization in which the closest train poses are encoded and used to refine camera position estimation. This procedure achieves a new state-of-the-art position accuracy for APRs, on both the CambridgeLandmarks and 7Scenes benchmarks. We also show that train images can be reconstructed from the learned pose encoding, paving the way for integrating visual information from the train set at a low memory cost. Our code and pre-trained models are available at https://github.com/yolish/camera-pose-auto-encoders.

READ FULL TEXT
research
05/05/2022

ImPosIng: Implicit Pose Encoding for Efficient Camera Pose Estimation

We propose a novel learning-based formulation for camera pose estimation...
research
12/22/2020

Do We Really Need Scene-specific Pose Encoders?

Visual pose regression models estimate the camera pose from a query imag...
research
03/05/2023

HyperPose: Camera Pose Localization using Attention Hypernetworks

In this study, we propose the use of attention hypernetworks in camera p...
research
03/21/2021

Paying Attention to Activation Maps in Camera Pose Regression

Camera pose regression methods apply a single forward pass to the query ...
research
03/31/2020

Real-Time Camera Pose Estimation for Sports Fields

Given an image sequence featuring a portion of a sports field filmed by ...
research
05/27/2017

PVEs: Position-Velocity Encoders for Unsupervised Learning of Structured State Representations

We propose position-velocity encoders (PVEs) which learn---without super...
research
01/05/2023

A Probabilistic Framework for Visual Localization in Ambiguous Scenes

Visual localization allows autonomous robots to relocalize when losing t...

Please sign up or login with your details

Forgot password? Click here to reset