Towards Interrogating Discriminative Machine Learning Models

05/23/2017
by   Wenbo Guo, et al.
0

It is oftentimes impossible to understand how machine learning models reach a decision. While recent research has proposed various technical approaches to provide some clues as to how a learning model makes individual decisions, they cannot provide users with ability to inspect a learning model as a complete entity. In this work, we propose a new technical approach that augments a Bayesian regression mixture model with multiple elastic nets. Using the enhanced mixture model, we extract explanations for a target model through global approximation. To demonstrate the utility of our approach, we evaluate it on different learning models covering the tasks of text mining and image recognition. Our results indicate that the proposed approach not only outperforms the state-of-the-art technique in explaining individual decisions but also provides users with an ability to discover the vulnerabilities of a learning model.

READ FULL TEXT

page 6

page 7

research
11/07/2018

Explaining Deep Learning Models - A Bayesian Non-parametric Approach

Understanding and interpreting how machine learning (ML) models make dec...
research
09/21/2023

Predictability and Comprehensibility in Post-Hoc XAI Methods: A User-Centered Analysis

Post-hoc explainability methods aim to clarify predictions of black-box ...
research
09/02/2019

Understanding Bias in Machine Learning

Bias is known to be an impediment to fair decisions in many domains such...
research
06/01/2023

Adversarial-Aware Deep Learning System based on a Secondary Classical Machine Learning Verification Approach

Deep learning models have been used in creating various effective image ...
research
06/04/2021

A Holistic Approach to Interpretability in Financial Lending: Models, Visualizations, and Summary-Explanations

Lending decisions are usually made with proprietary models that provide ...
research
07/20/2023

Prediction of Handball Matches with Statistically Enhanced Learning via Estimated Team Strengths

We propose a Statistically Enhanced Learning (aka. SEL) model to predict...

Please sign up or login with your details

Forgot password? Click here to reset