Explainability Is in the Mind of the Beholder: Establishing the Foundations of Explainable Artificial Intelligence

12/29/2021
by   Kacper Sokol, et al.
19

Explainable artificial intelligence and interpretable machine learning are research fields growing in importance. Yet, the underlying concepts remain somewhat elusive and lack generally agreed definitions. While recent inspiration from social sciences has refocused the work on needs and expectations of human recipients, the field still misses a concrete conceptualisation. We take steps towards addressing this challenge by reviewing the philosophical and social foundations of human explainability, which we then translate into the technological realm. In particular, we scrutinise the notion of algorithmic black boxes and the spectrum of understanding determined by explanatory processes and explainees' background knowledge. This approach allows us to define explainability as (logical) reasoning applied to transparent insights (into black boxes) interpreted under certain background knowledge - a process that engenders understanding in explainees. We then employ this conceptualisation to revisit the much disputed trade-off between transparency and predictive power and its implications for ante-hoc and post-hoc explainers as well as fairness and accountability engendered by explainability. We furthermore discuss components of the machine learning workflow that may be in need of interpretability, building on a range of ideas from human-centred explainability, with a focus on explainees, contrastive statements and explanatory processes. Our discussion reconciles and complements current research to help better navigate open questions - rather than attempting to address any individual issue - thus laying a solid foundation for a grounded discussion and future progress of explainable artificial intelligence and interpretable machine learning. We conclude with a summary of our findings, revisiting the human-centred explanatory process needed to achieve the desired level of algorithmic transparency.

READ FULL TEXT
research
07/01/2023

The future of human-centric eXplainable Artificial Intelligence (XAI) is not post-hoc explanations

Explainable Artificial Intelligence (XAI) plays a crucial role in enabli...
research
09/08/2022

What and How of Machine Learning Transparency: Building Bespoke Explainability Tools with Interoperable Algorithmic Components

Explainability techniques for data-driven predictive models based on art...
research
06/04/2023

(Un)reasonable Allure of Ante-hoc Interpretability for High-stakes Domains: Transparency Is Necessary but Insufficient for Explainability

Ante-hoc interpretability has become the holy grail of explainable machi...
research
07/26/2023

Revisiting the Performance-Explainability Trade-Off in Explainable Artificial Intelligence (XAI)

Within the field of Requirements Engineering (RE), the increasing signif...
research
03/02/2022

Satellite Image and Machine Learning based Knowledge Extraction in the Poverty and Welfare Domain

Recent advances in artificial intelligence and machine learning have cre...
research
06/03/2019

Kandinsky Patterns

Kandinsky Figures and Kandinsky Patterns are mathematically describable,...
research
11/01/2022

Evaluation Metrics for Symbolic Knowledge Extracted from Machine Learning Black Boxes: A Discussion Paper

As opaque decision systems are being increasingly adopted in almost any ...

Please sign up or login with your details

Forgot password? Click here to reset