Deep Visual City Recognition Visualization

05/06/2019
by   Xiangwei Shi, et al.
24

Understanding how cities visually differ from each others is interesting for planners, residents, and historians. We investigate the interpretation of deep features learned by convolutional neural networks (CNNs) for city recognition. Given a trained city recognition network, we first generate weighted masks using the known Grad-CAM technique and to select the most discriminate regions in the image. Since the image classification label is the city name, it contains no information of objects that are class-discriminate, we investigate the interpretability of deep representations with two methods. (i) Unsupervised method is used to cluster the objects appearing in the visual explanations. (ii) A pretrained semantic segmentation model is used to label objects in pixel level, and then we introduce statistical measures to quantitatively evaluate the interpretability of discriminate objects. The influence of network architectures and random initializations in training, is studied on the interpretability of CNN features for city recognition. The results suggest that network architectures would affect the interpretability of learned visual representations greater than different initializations.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

research
11/15/2017

Interpreting Deep Visual Representations via Network Dissection

The success of recent deep convolutional neural networks (CNNs) depends ...
research
01/27/2018

Understanding Deep Architectures by Interpretable Visual Summaries

A consistent body of research investigates the recurrent visual patterns...
research
02/02/2018

Visual Interpretability for Deep Learning: a Survey

This paper reviews recent studies in emerging directions of understandin...
research
12/31/2021

PCACE: A Statistical Approach to Ranking Neurons for CNN Interpretability

In this paper we introduce a new problem within the growing literature o...
research
07/23/2023

EnTri: Ensemble Learning with Tri-level Representations for Explainable Scene Recognition

Scene recognition based on deep-learning has made significant progress, ...
research
02/19/2023

TAX: Tendency-and-Assignment Explainer for Semantic Segmentation with Multi-Annotators

To understand how deep neural networks perform classification prediction...
research
07/10/2017

An Analysis of Human-centered Geolocation

Online social networks contain a constantly increasing amount of images ...

Please sign up or login with your details

Forgot password? Click here to reset