Interpretable Companions for Black-Box Models

02/10/2020
by   Danqing Pan, et al.
0

We present an interpretable companion model for any pre-trained black-box classifiers. The idea is that for any input, a user can decide to either receive a prediction from the black-box model, with high accuracy but no explanations, or employ a companion rule to obtain an interpretable prediction with slightly lower accuracy. The companion model is trained from data and the predictions of the black-box model, with the objective combining area under the transparency–accuracy curve and model complexity. Our model provides flexible choices for practitioners who face the dilemma of choosing between always using interpretable models and always using black-box models for a predictive task, so users can, for any given input, take a step back to resort to an interpretable prediction if they find the predictive performance satisfying, or stick to the black-box model if the rules are unsatisfying. To show the value of companion models, we design a human evaluation on more than a hundred people to investigate the tolerable accuracy loss to gain interpretability for humans.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/10/2019

Hybrid Predictive Model: When an Interpretable Model Collaborates with a Black-box Model

Interpretable machine learning has become a strong competitor for tradit...
research
11/17/2020

Augmented Fairness: An Interpretable Model Augmenting Decision-Makers' Fairness

We propose a model-agnostic approach for mitigating the prediction bias ...
research
07/31/2019

What's in the box? Explaining the black-box model through an evaluation of its interpretable features

Algorithms are powerful and necessary tools behind a large part of the i...
research
12/01/2022

Implicit Mixture of Interpretable Experts for Global and Local Interpretability

We investigate the feasibility of using mixtures of interpretable expert...
research
09/16/2021

Beyond Average Performance – exploring regions of deviating performance for black box classification models

Machine learning models are becoming increasingly popular in different t...
research
11/15/2016

Iterative Orthogonal Feature Projection for Diagnosing Bias in Black-Box Models

Predictive models are increasingly deployed for the purpose of determini...
research
10/31/2019

A study of data and label shift in the LIME framework

LIME is a popular approach for explaining a black-box prediction through...

Please sign up or login with your details

Forgot password? Click here to reset