Interpretable Semantic Photo Geolocalization

04/30/2021
by   Jonas Theiner, et al.
0

Planet-scale photo geolocalization is the complex task of estimating the location depicted in an image solely based on its visual content. Due to the success of convolutional neural networks (CNNs), current approaches achieve super-human performance. However, previous work has exclusively focused on optimizing geolocalization accuracy. Moreover, due to the black-box property of deep learning systems, their predictions are difficult to validate for humans. State-of-the-art methods treat the task as a classification problem, where the choice of the classes, that is the partitioning of the world map, is the key for success. In this paper, we present two contributions in order to improve the interpretability of a geolocalization model: (1) We propose a novel, semantic partitioning method which intuitively leads to an improved understanding of the predictions, while at the same time state-of-the-art results are achieved for geolocational accuracy on benchmark test sets; (2) We introduce a novel metric to assess the importance of semantic visual concepts for a certain prediction to provide additional interpretable information, which allows for a large-scale analysis of already trained models.

READ FULL TEXT

page 3

page 7

research
07/30/2021

Creating Powerful and Interpretable Models withRegression Networks

As the discipline has evolved, research in machine learning has been foc...
research
02/02/2018

Visual Interpretability for Deep Learning: a Survey

This paper reviews recent studies in emerging directions of understandin...
research
08/21/2017

More cat than cute? Interpretable Prediction of Adjective-Noun Pairs

The increasing availability of affect-rich multimedia resources has bols...
research
05/22/2022

Learnable Visual Words for Interpretable Image Recognition

To interpret deep models' predictions, attention-based visual cues are w...
research
01/05/2018

Efficient Image Evidence Analysis of CNN Classification Results

Convolutional neural networks (CNNs) define the current state-of-the-art...
research
01/20/2023

Image Memorability Prediction with Vision Transformers

Behavioral studies have shown that the memorability of images is similar...
research
08/06/2018

CPlaNet: Enhancing Image Geolocalization by Combinatorial Partitioning of Maps

Image geolocalization is the task of identifying the location depicted i...

Please sign up or login with your details

Forgot password? Click here to reset