Diagnostic Curves for Black Box Models

12/02/2019
by   David I. Inouye, et al.
0

In safety-critical applications of machine learning, it is often necessary to look beyond standard metrics such as test accuracy in order to validate various qualitative properties such as monotonicity with respect to a feature or combination of features, checking for undesirable changes or oscillations in the response, and differences in outcomes (e.g. discrimination) for a protected class. To help answer this need, we propose a framework for approximately validating (or invalidating) various properties of a black box model by finding a univariate diagnostic curve in the input space whose output maximally violates a given property. These diagnostic curves show the exact value of the model along the curve and can be displayed with a simple and intuitive line graph. We demonstrate the usefulness of these diagnostic curves across multiple use-cases and datasets including selecting between two models and understanding out-of-sample behavior.

READ FULL TEXT
research
04/16/2023

Explanations of Black-Box Models based on Directional Feature Interactions

As machine learning algorithms are deployed ubiquitously to a variety of...
research
06/26/2018

A Theory of Diagnostic Interpretation in Supervised Classification

Interpretable deep learning is a fundamental building block towards safe...
research
02/23/2016

Auditing Black-box Models for Indirect Influence

Data-trained predictive models see widespread use, but for the most part...
research
10/10/2022

Investigating the Failure Modes of the AUC metric and Exploring Alternatives for Evaluating Systems in Safety Critical Applications

With the increasing importance of safety requirements associated with th...
research
04/08/2023

Counterfactual Explanations of Neural Network-Generated Response Curves

Response curves exhibit the magnitude of the response of a sensitive sys...
research
09/21/2023

Regionally Additive Models: Explainable-by-design models minimizing feature interactions

Generalized Additive Models (GAMs) are widely used explainable-by-design...
research
05/09/2011

Evaluating the diagnostic powers of variables and their linear combinations when the gold standard is continuous

The receiver operating characteristic (ROC) curve is a very useful tool ...

Please sign up or login with your details

Forgot password? Click here to reset