SpArX: Sparse Argumentative Explanations for Neural Networks

01/23/2023
by   Hamed Ayoobi, et al.
0

Neural networks (NNs) have various applications in AI, but explaining their decision process remains challenging. Existing approaches often focus on explaining how changing individual inputs affects NNs' outputs. However, an explanation that is consistent with the input-output behaviour of an NN is not necessarily faithful to the actual mechanics thereof. In this paper, we exploit relationships between multi-layer perceptrons (MLPs) and quantitative argumentation frameworks (QAFs) to create argumentative explanations for the mechanics of MLPs. Our SpArX method first sparsifies the MLP while maintaining as much of the original mechanics as possible. It then translates the sparse MLP into an equivalent QAF to shed light on the underlying decision process of the MLP, producing global and/or local explanations. We demonstrate experimentally that SpArX can give more faithful explanations than existing approaches, while simultaneously providing deeper insights into the actual reasoning process of MLPs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/21/2022

Explaining Random Forests using Bipolar Argumentation and Markov Networks (Technical Report)

Random forests are decision tree ensembles that can be used to solve a v...
research
12/10/2020

DAX: Deep Argumentative eXplanation for Neural Networks

Despite the rapid growth in attention on eXplainable AI (XAI) of late, e...
research
12/18/2020

Towards Robust Explanations for Deep Neural Networks

Explanation methods shed light on the decision process of black-box clas...
research
06/16/2021

Best of both worlds: local and global explanations with human-understandable concepts

Interpretability techniques aim to provide the rationale behind a model'...
research
05/14/2020

Distilling neural networks into skipgram-level decision lists

Several previous studies on explanation for recurrent neural networks fo...
research
07/25/2023

Argument Attribution Explanations in Quantitative Bipolar Argumentation Frameworks (Technical Report)

Argumentative explainable AI has been advocated by several in recent yea...
research
02/12/2020

Self-explaining AI as an alternative to interpretable AI

The ability to explain decisions made by AI systems is highly sought aft...

Please sign up or login with your details

Forgot password? Click here to reset