Interpretability Benchmark for Evaluating Spatial Misalignment of Prototypical Parts Explanations

08/16/2023
by   Mikołaj Sacha, et al.
0

Prototypical parts-based networks are becoming increasingly popular due to their faithful self-explanations. However, their similarity maps are calculated in the penultimate network layer. Therefore, the receptive field of the prototype activation region often depends on parts of the image outside this region, which can lead to misleading interpretations. We name this undesired behavior a spatial explanation misalignment and introduce an interpretability benchmark with a set of dedicated metrics for quantifying this phenomenon. In addition, we propose a method for misalignment compensation and apply it to existing state-of-the-art models. We show the expressiveness of our benchmark and the effectiveness of the proposed compensation methodology through extensive empirical studies.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 7

page 12

page 13

page 14

research
06/21/2018

On the Robustness of Interpretability Methods

We argue that robustness of explanations---i.e., that similar inputs sho...
research
06/09/2021

On Sample Based Explanation Methods for NLP:Efficiency, Faithfulness, and Semantic Evaluation

In the recent advances of natural language processing, the scale of the ...
research
12/07/2022

Learning to Select Prototypical Parts for Interpretable Sequential Data Modeling

Prototype-based interpretability methods provide intuitive explanations ...
research
06/27/2022

RES: A Robust Framework for Guiding Visual Explanation

Despite the fast progress of explanation techniques in modern Deep Neura...
research
04/13/2023

Evaluating the Robustness of Interpretability Methods through Explanation Invariance and Equivariance

Interpretability methods are valuable only if their explanations faithfu...
research
06/10/2020

Why is Attention Not So Attentive?

Attention-based methods have played an important role in model interpret...
research
11/29/2020

ProtoPShare: Prototype Sharing for Interpretable Image Classification and Similarity Discovery

In this paper, we introduce ProtoPShare, a self-explained method that in...

Please sign up or login with your details

Forgot password? Click here to reset