HSCNet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer

05/05/2023
by   Shuzhe Wang, et al.
0

Visual localization is critical to many applications in computer vision and robotics. To address single-image RGB localization, state-of-the-art feature-based methods match local descriptors between a query image and a pre-built 3D model. Recently, deep neural networks have been exploited to regress the mapping between raw pixels and 3D coordinates in the scene, and thus the matching is implicitly performed by the forward pass through the network. However, in a large and ambiguous environment, learning such a regression task directly can be difficult for a single network. In this work, we present a new hierarchical scene coordinate network to predict pixel scene coordinates in a coarse-to-fine manner from a single RGB image. The proposed method, which is an extension of HSCNet, allows us to train compact models which scale robustly to large environments. It sets a new state-of-the-art for single-image localization on the 7-Scenes, 12 Scenes, Cambridge Landmarks datasets, and the combined indoor scenes.

READ FULL TEXT

page 8

page 12

page 13

research
09/13/2019

Hierarchical Joint Scene Coordinate Classification and Regression for Visual Localization

Visual localization is pivotal to many applications in computer vision a...
research
07/21/2023

SACReg: Scene-Agnostic Coordinate Regression for Visual Localization

Scene coordinates regression (SCR), i.e., predicting 3D coordinates for ...
research
07/28/2023

D2S: Representing local descriptors and global scene coordinates for camera relocalization

State-of-the-art visual localization methods mostly rely on complex proc...
research
03/07/2023

Where We Are and What We're Looking At: Query Based Worldwide Image Geo-localization Using Hierarchies and Scenes

Determining the exact latitude and longitude that a photo was taken is a...
research
04/12/2023

SGL: Structure Guidance Learning for Camera Localization

Camera localization is a classical computer vision task that serves vari...
research
05/23/2021

VS-Net: Voting with Segmentation for Visual Localization

Visual localization is of great importance in robotics and computer visi...
research
04/29/2022

Where in the World is this Image? Transformer-based Geo-localization in the Wild

Predicting the geographic location (geo-localization) from a single grou...

Please sign up or login with your details

Forgot password? Click here to reset