BELLA: Black box model Explanations by Local Linear Approximations

05/18/2023
by   Nedeljko Radulovic, et al.
0

In recent years, understanding the decision-making process of black-box models has become not only a legal requirement but also an additional way to assess their performance. However, the state of the art post-hoc interpretation approaches rely on synthetic data generation. This introduces uncertainty and can hurt the reliability of the interpretations. Furthermore, they tend to produce explanations that apply to only very few data points. This makes the explanations brittle and limited in scope. Finally, they provide scores that have no direct verifiable meaning. In this paper, we present BELLA, a deterministic model-agnostic post-hoc approach for explaining the individual predictions of regression black-box models. BELLA provides explanations in the form of a linear model trained in the feature space. Thus, its coefficients can be used directly to compute the predicted value from the feature values. Furthermore, BELLA maximizes the size of the neighborhood to which the linear model applies, so that the explanations are accurate, simple, general, and robust. BELLA can produce both factual and counterfactual explanations. Our user study confirms the importance of the desiderata we optimize, and our experiments show that BELLA outperforms the state-of-the-art approaches on these desiderata.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/05/2019

Global Aggregations of Local Explanations for Black Box models

The decision-making process of many state-of-the-art machine learning mo...
research
02/02/2022

Analogies and Feature Attributions for Model Agnostic Explanation of Similarity Learners

Post-hoc explanations for black box models have been studied extensively...
research
09/11/2020

Accurate and Intuitive Contextual Explanations using Linear Model Trees

With the ever-increasing use of complex machine learning models in criti...
research
05/30/2022

Fooling SHAP with Stealthily Biased Sampling

SHAP explanations aim at identifying which features contribute the most ...
research
10/01/2021

Consistent Explanations by Contrastive Learning

Understanding and explaining the decisions of neural networks are critic...
research
04/29/2019

Why should you trust my interpretation? Understanding uncertainty in LIME predictions

Methods for interpreting machine learning black-box models increase the ...
research
06/27/2018

Piecewise Approximations of Black Box Models for Model Interpretation

Machine Learning models have proved extremely successful for a wide vari...

Please sign up or login with your details

Forgot password? Click here to reset