MixVPR: Feature Mixing for Visual Place Recognition

03/03/2023
by   Amar Ali-bey, et al.
0

Visual Place Recognition (VPR) is a crucial part of mobile robotics and autonomous driving as well as other computer vision tasks. It refers to the process of identifying a place depicted in a query image using only computer vision. At large scale, repetitive structures, weather and illumination changes pose a real challenge, as appearances can drastically change over time. Along with tackling these challenges, an efficient VPR technique must also be practical in real-world scenarios where latency matters. To address this, we introduce MixVPR, a new holistic feature aggregation technique that takes feature maps from pre-trained backbones as a set of global features. Then, it incorporates a global relationship between elements in each feature map in a cascade of feature mixing, eliminating the need for local or pyramidal aggregation as done in NetVLAD or TransVPR. We demonstrate the effectiveness of our technique through extensive experiments on multiple large-scale benchmarks. Our method outperforms all existing techniques by a large margin while having less than half the number of parameters compared to CosPlace and NetVLAD. We achieve a new all-time high recall@1 score of 94.6 MapillarySLS, and more importantly, 58.4 outperforms two-stage retrieval techniques such as Patch-NetVLAD, TransVPR and SuperGLUE all while being orders of magnitude faster. Our code and trained models are available at https://github.com/amaralibey/MixVPR.

READ FULL TEXT
research
10/19/2022

GSV-Cities: Toward Appropriate Supervised Visual Place Recognition

This paper aims to investigate representation learning for large scale v...
research
08/01/2019

Scalable Place Recognition Under Appearance Change for Autonomous Driving

A major challenge in place recognition for autonomous driving is to be r...
research
07/26/2019

Place Clustering-based Feature Recombination for Visual Place Recognition

Visual place recognition is an important problem in both computer vision...
research
02/23/2020

NeurIPS 2019 Disentanglement Challenge: Improved Disentanglement through Aggregated Convolutional Feature Maps

This report to our stage 1 submission to the NeurIPS 2019 disentanglemen...
research
03/24/2023

PanoVPR: Towards Unified Perspective-to-Equirectangular Visual Place Recognition via Sliding Windows across the Panoramic View

Visual place recognition has received increasing attention in recent yea...
research
01/06/2022

TransVPR: Transformer-based place recognition with multi-level attention aggregation

Visual place recognition is a challenging task for applications such as ...
research
02/27/2020

NeurIPS 2019 Disentanglement Challenge: Improved Disentanglement through Learned Aggregation of Convolutional Feature Maps

This report to our stage 2 submission to the NeurIPS 2019 disentanglemen...

Please sign up or login with your details

Forgot password? Click here to reset