Exact Shapley Values for Local and Model-True Explanations of Decision Tree Ensembles

12/16/2021
by   Thomas W. Campbell, et al.
0

Additive feature explanations using Shapley values have become popular for providing transparency into the relative importance of each feature to an individual prediction of a machine learning model. While Shapley values provide a unique additive feature attribution in cooperative game theory, the Shapley values that can be generated for even a single machine learning model are far from unique, with theoretical and implementational decisions affecting the resulting attributions. Here, we consider the application of Shapley values for explaining decision tree ensembles and present a novel approach to Shapley value-based feature attribution that can be applied to random forests and boosted decision trees. This new method provides attributions that accurately reflect details of the model prediction algorithm for individual instances, while being computationally competitive with one of the most widely used current methods. We explain the theoretical differences between the standard and novel approaches and compare their performance using synthetic and real data.

READ FULL TEXT
research
06/19/2017

Consistent feature attribution for tree ensembles

It is critical in many applications to understand what features are impo...
research
10/18/2021

RKHS-SHAP: Shapley Values for Kernel Methods

Feature attribution for kernel methods is often heuristic and not indivi...
research
09/29/2022

Understanding Interventional TreeSHAP : How and Why it Works

Shapley values are ubiquitous in interpretable Machine Learning due to t...
research
02/12/2018

Consistent Individualized Feature Attribution for Tree Ensembles

Interpreting predictions from tree ensemble methods such as gradient boo...
research
06/29/2020

True to the Model or True to the Data?

A variety of recent papers discuss the application of Shapley values, a ...
research
11/21/2022

Explaining Random Forests using Bipolar Argumentation and Markov Networks (Technical Report)

Random forests are decision tree ensembles that can be used to solve a v...
research
10/27/2020

GPUTreeShap: Fast Parallel Tree Interpretability

SHAP (SHapley Additive exPlanation) values provide a game theoretic inte...

Please sign up or login with your details

Forgot password? Click here to reset