A Generalized Variable Importance Metric and Estimator for Black Box Machine Learning Models

The aim of this study is to define importance of predictors for black box machine learning methods, where the prediction function can be highly non-additive and cannot be represented by statistical parameters. In this paper we defined a “Generalized Variable Importance Metric (GVIM)” using the true conditional expectation function for a continuous or a binary response variable. We further showed that the defined GVIM can be represented as a function of the Conditional Average Treatment Effect (CATE) squared for multinomial and continuous predictors. Then we propose how the metric can be estimated using using any machine learning models. Finally we showed the properties of the estimator using multiple simulations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/01/2019

Explaining black box decisions by Shapley cohort refinement

We introduce a variable importance measure to explain the importance of ...
research
08/26/2019

Shapley Decomposition of R-Squared in Machine Learning Models

In this paper we introduce a metric aimed at helping machine learning pr...
research
11/21/2021

Decorrelated Variable Importance

Because of the widespread use of black box prediction methods such as ra...
research
04/15/2023

The XAISuite framework and the implications of explanatory system dissonance

Explanatory systems make machine learning models more transparent. Howev...
research
02/11/2018

Global Model Interpretation via Recursive Partitioning

In this work, we propose a simple but effective method to interpret blac...
research
01/04/2021

Gaussian Function On Response Surface Estimation

We propose a new framework for 2-D interpreting (features and samples) b...

Please sign up or login with your details

Forgot password? Click here to reset