Model-Agnostic Interpretable and Data-driven suRRogates suited for highly regulated industries

07/14/2020
by   Roel Henckaerts, et al.
0

Highly regulated industries, like banking and insurance, ask for transparent decision-making algorithms. At the same time, competitive markets push for sophisticated black box models. We therefore present a procedure to develop a Model-Agnostic Interpretable Data-driven suRRogate, suited for structured tabular data. Insights are extracted from a black box via partial dependence effects. These are used to group feature values, resulting in a segmentation of the feature space with automatic feature selection. A transparent generalized linear model (GLM) is fit to the features in categorical format and their relevant interactions. We demonstrate our R package maidrr with a case study on general insurance claim frequency modeling for six public datasets. Our maidrr GLM closely approximates a gradient boosting machine (GBM) and outperforms both a linear and tree surrogate as benchmarks.

READ FULL TEXT

page 9

page 12

research
11/17/2020

Augmented Fairness: An Interpretable Model Augmenting Decision-Makers' Fairness

We propose a model-agnostic approach for mitigating the prediction bias ...
research
10/29/2019

bLIMEy: Surrogate Prediction Explanations Beyond LIME

Surrogate explainers of black-box machine learning predictions are of pa...
research
06/04/2019

Concept Tree: High-Level Representation of Variables for More Interpretable Surrogate Decision Trees

Interpretable surrogates of black-box predictors trained on high-dimensi...
research
06/27/2022

Thermodynamics of Interpretation

Over the past few years, different types of data-driven Artificial Intel...
research
09/12/2021

Automatic Componentwise Boosting: An Interpretable AutoML System

In practice, machine learning (ML) workflows require various different s...
research
07/15/2020

VAE-LIME: Deep Generative Model Based Approach for Local Data-Driven Model Interpretability Applied to the Ironmaking Industry

Machine learning applied to generate data-driven models are lacking of t...
research
08/05/2022

Black box approximation in the tensor train format initialized by ANOVA decomposition

Surrogate models can reduce computational costs for multivariable functi...

Please sign up or login with your details

Forgot password? Click here to reset