STR-GQN: Scene Representation and Rendering for Unknown Cameras Based on Spatial Transformation Routing

08/06/2021
by   Wen-Cheng Chen, et al.
8

Geometry-aware modules are widely applied in recent deep learning architectures for scene representation and rendering. However, these modules require intrinsic camera information that might not be obtained accurately. In this paper, we propose a Spatial Transformation Routing (STR) mechanism to model the spatial properties without applying any geometric prior. The STR mechanism treats the spatial transformation as the message passing process, and the relation between the view poses and the routing weights is modeled by an end-to-end trainable neural network. Besides, an Occupancy Concept Mapping (OCM) framework is proposed to provide explainable rationals for scene-fusion processes. We conducted experiments on several datasets and show that the proposed STR mechanism improves the performance of the Generative Query Network (GQN). The visualization results reveal that the routing process can pass the observed information from one location of some view to the associated location in the other view, which demonstrates the advantage of the proposed model in terms of spatial cognition.

READ FULL TEXT

page 7

page 13

page 14

page 15

page 16

page 17

page 18

page 19

research
12/01/2022

Unbiased Heterogeneous Scene Graph Generation with Relation-aware Message Passing Neural Network

Recent scene graph generation (SGG) frameworks have focused on learning ...
research
07/25/2019

SceneGraphNet: Neural Message Passing for 3D Indoor Scene Augmentation

In this paper we propose a neural message passing approach to augment an...
research
05/05/2023

General Neural Gauge Fields

The recent advance of neural fields, such as neural radiance fields, has...
research
12/06/2018

Context-Aware Synthesis and Placement ofObject Instances

Learning to insert an object instance into an image in a semantically co...
research
12/06/2018

Context-Aware Synthesis and Placement of Object Instances

Learning to insert an object instance into an image in a semantically co...
research
08/22/2023

Enhancing NeRF akin to Enhancing LLMs: Generalizable NeRF Transformer with Mixture-of-View-Experts

Cross-scene generalizable NeRF models, which can directly synthesize nov...
research
03/07/2019

Stratified Labeling for Surface Consistent Parallax Correction and Occlusion Completion

The light field faithfully records the spatial and angular configuration...

Please sign up or login with your details

Forgot password? Click here to reset