A Comparative Study of Faithfulness Metrics for Model Interpretability Methods

04/12/2022
by   Chun Sik Chan, et al.
0

Interpretation methods to reveal the internal reasoning processes behind machine learning models have attracted increasing attention in recent years. To quantify the extent to which the identified interpretations truly reflect the intrinsic decision-making mechanisms, various faithfulness evaluation metrics have been proposed. However, we find that different faithfulness metrics show conflicting preferences when comparing different interpretations. Motivated by this observation, we aim to conduct a comprehensive and comparative study of the widely adopted faithfulness metrics. In particular, we introduce two assessment dimensions, namely diagnosticity and time complexity. Diagnosticity refers to the degree to which the faithfulness metric favours relatively faithful interpretations over randomly generated ones, and time complexity is measured by the average number of model forward passes. According to the experimental results, we find that sufficiency and comprehensiveness metrics have higher diagnosticity and lower time complexity than the other faithfulness metric

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/17/2020

Impact of Accuracy on Model Interpretations

Model interpretations are often used in practice to extract real world i...
research
03/19/2021

Interpretable Deep Learning: Interpretations, Interpretability, Trustworthiness, and Beyond

Deep neural networks have been well-known for their superb performance i...
research
09/16/2020

Are Interpretations Fairly Evaluated? A Definition Driven Pipeline for Post-Hoc Interpretability

Recent years have witnessed an increasing number of interpretation metho...
research
04/11/2018

A synopsis of comparative metrics for classifications

Phylogeny is the study of the relations between biological entities. Fro...
research
04/18/2021

On the Faithfulness Measurements for Model Interpretations

Recent years have witnessed the emergence of a variety of post-hoc inter...
research
01/16/2019

An analysis of the Geodesic Distance and other comparative metrics for tree-like structures

Graphs are interesting structures: extremely useful to depict real-life ...
research
12/23/2021

More Than Words: Towards Better Quality Interpretations of Text Classifiers

The large size and complex decision mechanisms of state-of-the-art text ...

Please sign up or login with your details

Forgot password? Click here to reset