Explaining Predictions by Approximating the Local Decision Boundary

06/14/2020
by   Georgios Vlassopoulos, et al.
0

Constructing accurate model-agnostic explanations for opaque machine learning models remains a challenging task. Classification models for high-dimensional data, like images, are often complex and highly parameterized. To reduce this complexity, various authors attempt to explain individual predictions locally, either in terms of a simpler local surrogate model or by communicating how the predictions contrast with those of another class. However, existing approaches still fall short in the following ways: a) they measure locality using a (Euclidean) metric that is not meaningful for non-linear high-dimensional data; or b) they do not attempt to explain the decision boundary, which is the most relevant characteristic of classifiers that are optimized for classification accuracy; or c) they do not give the user any freedom in specifying attributes that are meaningful to them. We address these issues in a new procedure for local decision boundary approximation (DBA). To construct a meaningful metric, we train a variational autoencoder to learn a Euclidean latent space of encoded data representations. We impose interpretability by exploiting attribute annotations to map the latent space to attributes that are meaningful to the user. A difficulty in evaluating explainability approaches is the lack of a ground truth. We address this by introducing a new benchmark data set with artificially generated Iris images, and showing that we can recover the latent attributes that locally determine the class. We further evaluate our approach on the CelebA image data set.

READ FULL TEXT

page 30

page 32

research
10/14/2020

Human-interpretable model explainability on high-dimensional data

The importance of explainability in machine learning continues to grow, ...
research
08/12/2019

Variational Autoencoded Regression: High Dimensional Regression of Visual Data on Complex Manifold

This paper proposes a new high dimensional regression method by merging ...
research
09/16/2020

Analysis of Generalizability of Deep Neural Networks Based on the Complexity of Decision Boundary

For supervised learning models, the analysis of generalization ability (...
research
10/28/2021

Explaining Latent Representations with a Corpus of Examples

Modern machine learning models are complicated. Most of them rely on con...
research
09/30/2021

XPROAX-Local explanations for text classification with progressive neighborhood approximation

The importance of the neighborhood for training a local surrogate model ...
research
05/23/2023

Physics-Assisted Reduced-Order Modeling for Identifying Dominant Features of Transonic Buffet

Transonic buffet is a flow instability phenomenon that arises from the i...
research
06/05/2019

Interpretable and Differentially Private Predictions

Interpretable predictions, where it is clear why a machine learning mode...

Please sign up or login with your details

Forgot password? Click here to reset