Adversarial Joint Image and Pose Distribution Learning for Camera Pose Regression and Refinement

03/15/2019
by   Mai Bui, et al.
8

In this paper we present a deep-learning based framework for direct camera pose regression and refinement using RGB information only. For this aim we introduce a novel framework for camera pose estimation, that regresses the camera pose as well as offers a solely RGB-based solution for camera pose refinement. Utilizing research results of recent camera pose regression methods, we investigate the effect of adversarial networks on convolutional neural networks (CNNs) trained for camera re-localization applications, with the goal to better learn the geometric connection between camera pose and corresponding RGB image. Similar to Generative Adversarial Networks (GANs), in addition to a camera pose regressor, mapping images to poses, we propose to train a discriminator that effectively distinguishes between regressed and ground truth poses. This pose discriminator is conditioned on features extracted from the respective input image to implicitly model the relationship between ground truth or regressed poses, and once learned can be used to update the predicted camera poses and improve the localization accuracy.

READ FULL TEXT

page 1

page 7

page 10

page 11

research
12/04/2019

GraphPoseGAN: 3D Hand Pose Estimation from a Monocular RGB Image via Adversarial Learning on Graphs

This paper addresses the problem of 3D hand pose estimation from a monoc...
research
06/23/2020

PoseGAN: A Pose-to-Image Translation Framework for Camera Localization

Camera localization is a fundamental requirement in robotics and compute...
research
04/27/2023

ContraNeRF: 3D-Aware Generative Model via Contrastive Learning with Unsupervised Implicit Pose Embedding

Although 3D-aware GANs based on neural radiance fields have achieved com...
research
02/10/2023

CGA-PoseNet: Camera Pose Regression via a 1D-Up Approach to Conformal Geometric Algebra

We introduce CGA-PoseNet, which uses the 1D-Up approach to Conformal Geo...
research
10/03/2017

Simulating Structure-from-Motion

The implementation of a Structure-from-Motion (SfM) pipeline from a synt...
research
09/01/2021

On the Limits of Pseudo Ground Truth in Visual Camera Re-localisation

Benchmark datasets that measure camera pose accuracy have driven progres...
research
02/03/2023

Robust Camera Pose Refinement for Multi-Resolution Hash Encoding

Multi-resolution hash encoding has recently been proposed to reduce the ...

Please sign up or login with your details

Forgot password? Click here to reset