Hierarchical Joint Scene Coordinate Classification and Regression for Visual Localization

09/13/2019
by   Xiaotian Li, et al.
8

Visual localization is pivotal to many applications in computer vision and robotics. To address single-image RGB localization, state-of-the-art feature based methods solve the task by matching local descriptors between a query image and a pre-built 3D model. Recently, deep neural networks have been exploited to directly learn the mapping between raw pixels and 3D coordinates in the scene, and thus the matching is implicitly performed by the forward pass through the network. In this work, we present a new hierarchical joint classification-regression network to predict pixel scene coordinates in a coarse-to-fine manner from a single RGB image. The network consists of a series of output layers with each of them conditioned on the outputs of previous ones, where the final output layer regresses the coordinates and the others produce coarse location labels. Our experiments show that the proposed method outperforms the vanilla scene coordinate regression network and is more scalable to large environments. With data augmentation, it achieves the state-of-the-art single-image RGB localization performance on three benchmark datasets.

READ FULL TEXT

page 2

page 4

page 7

research
05/05/2023

HSCNet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer

Visual localization is critical to many applications in computer vision ...
research
03/31/2021

Learning Camera Localization via Dense Scene Matching

Camera localization aims to estimate 6 DoF camera poses from RGB images....
research
07/21/2023

SACReg: Scene-Agnostic Coordinate Regression for Visual Localization

Scene coordinates regression (SCR), i.e., predicting 3D coordinates for ...
research
07/28/2023

D2S: Representing local descriptors and global scene coordinates for camera relocalization

State-of-the-art visual localization methods mostly rely on complex proc...
research
08/15/2018

Scene Coordinate Regression with Angle-Based Reprojection Loss for Camera Relocalization

Image-based camera relocalization is an important problem in computer vi...
research
05/29/2018

CocoNet: A deep neural network for mapping pixel coordinates to color values

In this paper, we propose a deep neural network approach for mapping the...
research
04/29/2022

Where in the World is this Image? Transformer-based Geo-localization in the Wild

Predicting the geographic location (geo-localization) from a single grou...

Please sign up or login with your details

Forgot password? Click here to reset