Truthful Meta-Explanations for Local Interpretability of Machine Learning Models

12/07/2022
by   Ioannis Mollas, et al.
0

Automated Machine Learning-based systems' integration into a wide range of tasks has expanded as a result of their performance and speed. Although there are numerous advantages to employing ML-based systems, if they are not interpretable, they should not be used in critical, high-risk applications where human lives are at risk. To address this issue, researchers and businesses have been focusing on finding ways to improve the interpretability of complex ML systems, and several such methods have been developed. Indeed, there are so many developed techniques that it is difficult for practitioners to choose the best among them for their applications, even when using evaluation metrics. As a result, the demand for a selection tool, a meta-explanation technique based on a high-quality evaluation metric, is apparent. In this paper, we present a local meta-explanation technique which builds on top of the truthfulness metric, which is a faithfulness-based metric. We demonstrate the effectiveness of both the technique and the metric by concretely defining all the concepts and through experimentation.

READ FULL TEXT

page 7

page 12

page 17

research
10/15/2020

Altruist: Argumentative Explanations through Local Interpretations of Predictive Models

Interpretable machine learning is an emerging field providing solutions ...
research
02/14/2023

The Meta-Evaluation Problem in Explainable AI: Identifying Reliable Estimators with MetaQuantus

Explainable AI (XAI) is a rapidly evolving field that aims to improve tr...
research
05/24/2022

Interpretation Quality Score for Measuring the Quality of interpretability methods

Machine learning (ML) models have been applied to a wide range of natura...
research
11/10/2021

A Meta-Method for Portfolio Management Using Machine Learning for Adaptive Strategy Selection

This work proposes a novel portfolio management technique, the Meta Port...
research
09/18/2020

Evaluation of Local Explanation Methods for Multivariate Time Series Forecasting

Being able to interpret a machine learning model is a crucial task in ma...
research
01/14/2019

Interpretable machine learning: definitions, methods, and applications

Machine-learning models have demonstrated great success in learning comp...
research
08/02/2022

ferret: a Framework for Benchmarking Explainers on Transformers

Many interpretability tools allow practitioners and researchers to expla...

Please sign up or login with your details

Forgot password? Click here to reset