Deep Visual Geo-localization Benchmark

04/07/2022
by   Gabriele Berton, et al.
0

In this paper, we propose a new open-source benchmarking framework for Visual Geo-localization (VG) that allows to build, train, and test a wide range of commonly used architectures, with the flexibility to change individual components of a geo-localization pipeline. The purpose of this framework is twofold: i) gaining insights into how different components and design choices in a VG pipeline impact the final results, both in terms of performance (recall@N metric) and system requirements (such as execution time and memory consumption); ii) establish a systematic evaluation protocol for comparing different methods. Using the proposed framework, we perform a large suite of experiments which provide criteria for choosing backbone, aggregation and negative mining depending on the use-case and requirements. We also assess the impact of engineering techniques like pre/post-processing, data augmentation and image resizing, showing that better performance can be obtained through somewhat simple procedures: for example, downscaling the images' resolution to 80 dataset storage requirement. Code and trained models are available at https://deep-vg-bench.herokuapp.com/.

READ FULL TEXT

page 13

page 14

research
04/05/2022

Rethinking Visual Geo-localization for Large-Scale Applications

Visual Geo-localization (VG) is the task of estimating the position wher...
research
09/09/2017

How to Train a CAT: Learning Canonical Appearance Transformations for Direct Visual Localization Under Illumination Change

Direct visual localization has recently enjoyed a resurgence in populari...
research
10/27/2021

Identifying the key components in ResNet-50 for diabetic retinopathy grading from fundus images: a systematic investigation

Although deep learning based diabetic retinopathy (DR) classification me...
research
11/16/2021

Code-free development and deployment of deep segmentation models for digital pathology

Application of deep learning on histopathological whole slide images (WS...
research
08/30/2022

A Closer Look at Weakly-Supervised Audio-Visual Source Localization

Audio-visual source localization is a challenging task that aims to pred...
research
02/17/2021

Automatic Generation of Interpolants for Lattice Samplings: Part II – Implementation and Code Generation

In the prequel to this paper, we presented a systematic framework for pr...
research
09/23/2021

Distiller: A Systematic Study of Model Distillation Methods in Natural Language Processing

We aim to identify how different components in the KD pipeline affect th...

Please sign up or login with your details

Forgot password? Click here to reset