PoseGAN: A Pose-to-Image Translation Framework for Camera Localization

06/23/2020
by   Kanglin Liu, et al.
37

Camera localization is a fundamental requirement in robotics and computer vision. This paper introduces a pose-to-image translation framework to tackle the camera localization problem. We present PoseGANs, a conditional generative adversarial networks (cGANs) based framework for the implementation of pose-to-image translation. PoseGANs feature a number of innovations including a distance metric based conditional discriminator to conduct camera localization and a pose estimation technique for generated camera images as a stronger constraint to improve camera localization performance. Compared with learning-based regression methods such as PoseNet, PoseGANs can achieve better performance with model sizes that are 70 introduce the view synthesis technique to establish the correspondence between the 2D images and the scene, i.e., given a pose, PoseGANs are able to synthesize its corresponding camera images. Furthermore, we demonstrate that PoseGANs differ in principle from structure-based localization and learning-based regressions for camera localization, and show that PoseGANs exploit the geometric structures to accomplish the camera localization task, and is therefore more stable than and superior to learning-based regressions which rely on local texture features instead. In addition to camera localization and view synthesis, we also demonstrate that PoseGANs can be successfully used for other interesting applications such as moving object elimination and frame interpolation in video sequences.

READ FULL TEXT

page 9

page 10

page 11

page 12

page 13

research
03/15/2019

Adversarial Joint Image and Pose Distribution Learning for Camera Pose Regression and Refinement

In this paper we present a deep-learning based framework for direct came...
research
05/05/2022

ImPosIng: Implicit Pose Encoding for Efficient Camera Pose Estimation

We propose a novel learning-based formulation for camera pose estimation...
research
10/22/2020

Novel View Synthesis from only a 6-DoF Camera Pose by Two-stage Networks

Novel view synthesis is a challenging problem in computer vision and rob...
research
07/07/2021

Video-Based Camera Localization Using Anchor View Detection and Recursive 3D Reconstruction

In this paper we introduce a new camera localization strategy designed f...
research
02/21/2017

VidLoc: A Deep Spatio-Temporal Model for 6-DoF Video-Clip Relocalization

Machine learning techniques, namely convolutional neural networks (CNN) ...
research
12/25/2021

DeepMTL Pro: Deep Learning Based MultipleTransmitter Localization and Power Estimation

In this paper, we address the problem of Multiple Transmitter Localizati...
research
02/04/2020

Deep-Geometric 6 DoF Localization from a Single Image in Topo-metric Maps

We describe a Deep-Geometric Localizer that is able to estimate the full...

Please sign up or login with your details

Forgot password? Click here to reset