A psychological theory of explainability

05/17/2022
by   Scott Cheng-Hsin Yang, et al.
11

The goal of explainable Artificial Intelligence (XAI) is to generate human-interpretable explanations, but there are no computationally precise theories of how humans interpret AI generated explanations. The lack of theory means that validation of XAI must be done empirically, on a case-by-case basis, which prevents systematic theory-building in XAI. We propose a psychological theory of how humans draw conclusions from saliency maps, the most common form of XAI explanation, which for the first time allows for precise prediction of explainee inference conditioned on explanation. Our theory posits that absent explanation humans expect the AI to make similar decisions to themselves, and that they interpret an explanation by comparison to the explanations they themselves would give. Comparison is formalized via Shepard's universal law of generalization in a similarity space, a classic theory from cognitive science. A pre-registered user study on AI image classifications with saliency map explanations demonstrate that our theory quantitatively matches participants' predictions of the AI.

READ FULL TEXT

page 1

page 2

page 3

page 6

page 7

page 8

page 9

page 11

research
02/03/2020

Evaluating Saliency Map Explanations for Convolutional Neural Networks: A User Study

Convolutional neural networks (CNNs) offer great machine learning perfor...
research
06/20/2022

Eliminating The Impossible, Whatever Remains Must Be True

The rise of AI methods to make predictions and decisions has led to a pr...
research
08/14/2023

BSED: Baseline Shapley-Based Explainable Detector

Explainable artificial intelligence (XAI) has witnessed significant adva...
research
05/17/2022

Is explainable AI a race against model complexity?

Explaining the behaviour of intelligent systems will get increasingly an...
research
03/12/2021

Explainable AI by BAPC – Before and After correction Parameter Comparison

By means of a local surrogate approach, an analytical method to yield ex...
research
04/10/2023

Explanation Strategies for Image Classification in Humans vs. Current Explainable AI

Explainable AI (XAI) methods provide explanations of AI models, but our ...
research
07/31/2021

Towards explainable artificial intelligence (XAI) for early anticipation of traffic accidents

Traffic accident anticipation is a vital function of Automated Driving S...

Please sign up or login with your details

Forgot password? Click here to reset