LASER: LAtent SpacE Rendering for 2D Visual Localization

04/01/2022
by   Zhixiang Min, et al.
3

We present LASER, an image-based Monte Carlo Localization (MCL) framework for 2D floor maps. LASER introduces the concept of latent space rendering, where 2D pose hypotheses on the floor map are directly rendered into a geometrically-structured latent space by aggregating viewing ray features. Through a tightly coupled rendering codebook scheme, the viewing ray features are dynamically determined at rendering-time based on their geometries (i.e. length, incident-angle), endowing our representation with view-dependent fine-grain variability. Our codebook scheme effectively disentangles feature encoding from rendering, allowing the latent space rendering to run at speeds above 10KHz. Moreover, through metric learning, our geometrically-structured latent space is common to both pose hypotheses and query images with arbitrary field of views. As a result, LASER achieves state-of-the-art performance on large-scale indoor localization datasets (i.e. ZInD and Structured3D) for both panorama and perspective image queries, while significantly outperforming existing learning-based methods in speed.

READ FULL TEXT

page 3

page 6

page 7

page 8

research
06/25/2021

Half-body Portrait Relighting with Overcomplete Lighting Representation

We present a neural-based model for relighting a half-body portrait imag...
research
12/02/2021

Learning Neural Light Fields with Ray-Space Embedding Networks

Neural radiance fields (NeRFs) produce state-of-the-art view synthesis r...
research
01/13/2023

Laser: Latent Set Representations for 3D Generative Modeling

NeRF provides unparalleled fidelity of novel view synthesis: rendering a...
research
10/03/2018

Deep processing of structured data

We construct a general unified framework for learning representation of ...
research
06/21/2022

RendNet: Unified 2D/3D Recognizer With Latent Space Rendering

Vector graphics (VG) have been ubiquitous in our daily life with vast ap...
research
12/12/2022

ROAD: Learning an Implicit Recursive Octree Auto-Decoder to Efficiently Encode 3D Shapes

Compact and accurate representations of 3D shapes are central to many pe...
research
08/17/2023

Language-enhanced RNR-Map: Querying Renderable Neural Radiance Field maps with natural language

We present Le-RNR-Map, a Language-enhanced Renderable Neural Radiance ma...

Please sign up or login with your details

Forgot password? Click here to reset