RePAST: Relative Pose Attention Scene Representation Transformer

04/03/2023
by   Aleksandr Safin, et al.
0

The Scene Representation Transformer (SRT) is a recent method to render novel views at interactive rates. Since SRT uses camera poses with respect to an arbitrarily chosen reference camera, it is not invariant to the order of the input views. As a result, SRT is not directly applicable to large-scale scenes where the reference frame would need to be changed regularly. In this work, we propose Relative Pose Attention SRT (RePAST): Instead of fixing a reference frame at the input, we inject pairwise relative camera pose information directly into the attention mechanism of the Transformers. This leads to a model that is by definition invariant to the choice of any global reference frame, while still retaining the full capabilities of the original method. Empirical results show that adding this invariance to the model does not lead to a loss in quality. We believe that this is a step towards applying fully latent transformer-based rendering methods to large-scale scenes.

READ FULL TEXT
research
07/27/2022

Is Attention All NeRF Needs?

We present Generalizable NeRF Transformer (GNT), a pure, unified transfo...
research
03/05/2023

Learning to Localize in Unseen Scenes with Relative Pose Regressors

Relative pose regressors (RPRs) localize a camera by estimating its rela...
research
08/22/2023

Coarse-to-Fine Multi-Scene Pose Regression with Transformers

Absolute camera pose regressors estimate the position and orientation of...
research
03/21/2021

Learning Multi-Scene Absolute Pose Regression with Transformers

Absolute camera pose regressors estimate the position and orientation of...
research
05/28/2021

TransCamP: Graph Transformer for 6-DoF Camera Pose Estimation

Camera pose estimation or camera relocalization is the centerpiece in nu...
research
08/30/2023

Drone-NeRF: Efficient NeRF Based 3D Scene Reconstruction for Large-Scale Drone Survey

Neural rendering has garnered substantial attention owing to its capacit...
research
06/30/2023

Act3D: Infinite Resolution Action Detection Transformer for Robotic Manipulation

3D perceptual representations are well suited for robot manipulation as ...

Please sign up or login with your details

Forgot password? Click here to reset