Improving Worst Case Visual Localization Coverage via Place-specific Sub-selection in Multi-camera Systems

by   Stephen Hausler, et al.

6-DoF visual localization systems utilize principled approaches rooted in 3D geometry to perform accurate camera pose estimation of images to a map. Current techniques use hierarchical pipelines and learned 2D feature extractors to improve scalability and increase performance. However, despite gains in typical recall@0.25m type metrics, these systems still have limited utility for real-world applications like autonomous vehicles because of their `worst' areas of performance - the locations where they provide insufficient recall at a certain required error tolerance. Here we investigate the utility of using `place specific configurations', where a map is segmented into a number of places, each with its own configuration for modulating the pose estimation step, in this case selecting a camera within a multi-camera system. On the Ford AV benchmark dataset, we demonstrate substantially improved worst-case localization performance compared to using off-the-shelf pipelines - minimizing the percentage of the dataset which has low recall at a certain error tolerance, as well as improved overall localization performance. Our proposed approach is particularly applicable to the crowdsharing model of autonomous vehicle deployment, where a fleet of AVs are regularly traversing a known route.


page 1

page 6

page 7


Can You Trust Your Pose? Confidence Estimation in Visual Localization

Camera pose estimation in large-scale environments is still an open ques...

Robustness of control point configurations for homography and planar pose estimation

In this paper, we investigate the influence of the spatial configuration...

Locking On: Leveraging Dynamic Vehicle-Imposed Motion Constraints to Improve Visual Localization

Most 6-DoF localization and SLAM systems use static landmarks but ignore...

SHAPE: Linear-Time Camera Pose Estimation With Quadratic Error-Decay

We propose a novel camera pose estimation or perspective-n-point (PnP) a...

Automatic Coverage Selection for Surface-Based Visual Localization

Localization is a critical capability for robots, drones and autonomous ...

SASSE: Scalable and Adaptable 6-DOF Pose Estimation

Visual localization has become a key enabling component of many place re...

Unsupervised Quality Prediction for Improved Single-Frame and Weighted Sequential Visual Place Recognition

While substantial progress has been made in the absolute performance of ...

Please sign up or login with your details

Forgot password? Click here to reset