Streamlining models with explanations in the learning loop

02/15/2023
by   Francesco Lomuscio, et al.
0

Several explainable AI methods allow a Machine Learning user to get insights on the classification process of a black-box model in the form of local linear explanations. With such information, the user can judge which features are locally relevant for the classification outcome, and get an understanding of how the model reasons. Standard supervised learning processes are purely driven by the original features and target labels, without any feedback loop informed by the local relevance of the features identified by the post-hoc explanations. In this paper, we exploit this newly obtained information to design a feature engineering phase, where we combine explanations with feature values. To do so, we develop two different strategies, named Iterative Dataset Weighting and Targeted Replacement Values, which generate streamlined models that better mimic the explanation process presented to the user. We show how these streamlined models compare to the original black-box classifiers, in terms of accuracy and compactness of the newly produced explanations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/05/2020

Post-hoc explanation of black-box classifiers using confident itemsets

It is difficult to trust decisions made by Black-box Artificial Intellig...
research
06/24/2021

What will it take to generate fairness-preserving explanations?

In situations where explanations of black-box models may be useful, the ...
research
12/08/2022

XRand: Differentially Private Defense against Explanation-Guided Attacks

Recent development in the field of explainable artificial intelligence (...
research
08/27/2021

This looks more like that: Enhancing Self-Explaining Models by Prototypical Relevance Propagation

Current machine learning models have shown high efficiency in solving a ...
research
10/17/2022

RbX: Region-based explanations of prediction models

We introduce region-based explanations (RbX), a novel, model-agnostic me...
research
09/08/2022

From Shapley Values to Generalized Additive Models and back

In explainable machine learning, local post-hoc explanation algorithms a...
research
03/20/2022

Explainable Misinformation Detection Across Multiple Social Media Platforms

In this work, the integration of two machine learning approaches, namely...

Please sign up or login with your details

Forgot password? Click here to reset