Analyzing the Effects of Classifier Lipschitzness on Explainers

06/24/2022
by   Zulqarnain Khan, et al.
0

Machine learning methods are getting increasingly better at making predictions, but at the same time they are also becoming more complicated and less transparent. As a result, explainers are often relied on to provide interpretability to these black-box prediction models. As crucial diagnostics tools, it is important that these explainers themselves are reliable. In this paper we focus on one particular aspect of reliability, namely that an explainer should give similar explanations for similar data inputs. We formalize this notion by introducing and defining explainer astuteness, analogous to astuteness of classifiers. Our formalism is inspired by the concept of probabilistic Lipschitzness, which captures the probability of local smoothness of a function. For a variety of explainers (e.g., SHAP, RISE, CXPlain), we provide lower bound guarantees on the astuteness of these explainers given the Lipschitzness of the prediction function. These theoretical results imply that locally smooth prediction functions lend themselves to locally robust explanations. We evaluate these results empirically on simulated as well as real datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/19/2022

Black Box Model Explanations and the Human Interpretability Expectations – An Analysis in the Context of Homicide Prediction

Strategies based on Explainable Artificial Intelligence - XAI have promo...
research
04/16/2023

Explanations of Black-Box Models based on Directional Feature Interactions

As machine learning algorithms are deployed ubiquitously to a variety of...
research
11/22/2016

Programs as Black-Box Explanations

Recent work in model-agnostic explanations of black-box machine learning...
research
10/17/2022

RbX: Region-based explanations of prediction models

We introduce region-based explanations (RbX), a novel, model-agnostic me...
research
05/17/2023

Counterfactually Comparing Abstaining Classifiers

Abstaining classifiers have the option to abstain from making prediction...
research
11/02/2020

A Learning Theoretic Perspective on Local Explainability

In this paper, we explore connections between interpretable machine lear...
research
10/05/2021

Unpacking the Black Box: Regulating Algorithmic Decisions

We characterize optimal oversight of algorithms in a world where an agen...

Please sign up or login with your details

Forgot password? Click here to reset