Deep auxiliary learning for visual localization using colorization task

07/01/2021
by   Mi Tian, et al.
0

Visual localization is one of the most important components for robotics and autonomous driving. Recently, inspiring results have been shown with CNN-based methods which provide a direct formulation to end-to-end regress 6-DoF absolute pose. Additional information like geometric or semantic constraints is generally introduced to improve performance. Especially, the latter can aggregate high-level semantic information into localization task, but it usually requires enormous manual annotations. To this end, we propose a novel auxiliary learning strategy for camera localization by introducing scene-specific high-level semantics from self-supervised representation learning task. Viewed as a powerful proxy task, image colorization task is chosen as complementary task that outputs pixel-wise color version of grayscale photograph without extra annotations. In our work, feature representations from colorization network are embedded into localization network by design to produce discriminative features for pose regression. Meanwhile an attention mechanism is introduced for the benefit of localization performance. Extensive experiments show that our model significantly improve localization accuracy over state-of-the-arts on both indoor and outdoor datasets.

READ FULL TEXT

page 2

page 4

page 6

page 7

research
04/23/2018

VLocNet++: Deep Multitask Learning for Semantic Visual Localization and Odometry

Visual localization is one of the fundamental enablers of robot autonomy...
research
03/18/2019

Understanding the Limitations of CNN-based Absolute Camera Pose Regression

Visual localization is the task of accurate camera pose estimation in a ...
research
05/13/2020

3D Scene Geometry-Aware Constraint for Camera Localization with Deep Learning

Camera localization is a fundamental and key component of autonomous dri...
research
12/15/2017

Semantic Visual Localization

Robust visual localization under a wide range of viewing conditions is a...
research
10/24/2020

Improving the generalization of network based relative pose regression: dimension reduction as a regularizer

Visual localization occupies an important position in many areas such as...
research
09/26/2018

Vision-based Semantic Mapping and Localization for Autonomous Indoor Parking

In this paper, we proposed a novel and practical solution for the real-t...
research
11/23/2016

Image-based localization using LSTMs for structured feature correlation

In this work we propose a new CNN+LSTM architecture for camera pose regr...

Please sign up or login with your details

Forgot password? Click here to reset