(Un)reasonable Allure of Ante-hoc Interpretability for High-stakes Domains: Transparency Is Necessary but Insufficient for Explainability

06/04/2023
by   Kacper Sokol, et al.
0

Ante-hoc interpretability has become the holy grail of explainable machine learning for high-stakes domains such as healthcare; however, this notion is elusive, lacks a widely-accepted definition and depends on the deployment context. It can refer to predictive models whose structure adheres to domain-specific constraints, or ones that are inherently transparent. The latter notion assumes observers who judge this quality, whereas the former presupposes them to have technical and domain expertise, in certain cases rendering such models unintelligible. Additionally, its distinction from the less desirable post-hoc explainability, which refers to methods that construct a separate explanatory model, is vague given that transparent predictors may still require (post-)processing to yield satisfactory explanatory insights. Ante-hoc interpretability is thus an overloaded concept that comprises a range of implicit properties, which we unpack in this paper to better understand what is needed for its safe deployment across high-stakes domains. To this end, we outline model- and explainer-specific desiderata that allow us to navigate its distinct realisations in view of the envisaged application and audience.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/29/2021

Explainability Is in the Mind of the Beholder: Establishing the Foundations of Explainable Artificial Intelligence

Explainable artificial intelligence and interpretable machine learning a...
research
12/01/2022

Towards Explainability in Modular Autonomous Vehicle Software

Safety-critical Autonomous Systems require trustworthy and transparent d...
research
10/23/2020

Model Interpretability through the Lens of Computational Complexity

In spite of several claims stating that some models are more interpretab...
research
07/12/2021

Quantifying Explainability in NLP and Analyzing Algorithms for Performance-Explainability Tradeoff

The healthcare domain is one of the most exciting application areas for ...
research
06/29/2018

Posthoc Interpretability of Learning to Rank Models using Secondary Training Data

Predictive models are omnipresent in automated and assisted decision mak...
research
03/12/2023

Branch Learn with Post-hoc Correction for Predict+Optimize with Unknown Parameters in Constraints

Combining machine learning and constrained optimization, Predict+Optimiz...

Please sign up or login with your details

Forgot password? Click here to reset