GPUTreeShap: Fast Parallel Tree Interpretability

10/27/2020
by   Rory Mitchell, et al.
0

SHAP (SHapley Additive exPlanation) values provide a game theoretic interpretation of the predictions of machine learning models based on Shapley values. While SHAP values are intractable in general, a recursive polynomial time algorithm specialised for decision tree models is available, named TreeShap. Despite its polynomial time complexity, TreeShap can become a significant bottleneck in practical machine learning pipelines when applied to large decision tree ensembles. We present GPUTreeShap, a software package implementing a modified TreeShap algorithm in CUDA for Nvidia GPUs. Our approach first preprocesses the input model to isolate variable sized sub-problems from the original recursive algorithm, then solves a bin packing problem, and finally maps sub-problems to streaming multiprocessors for parallel execution with specialised hardware instructions. With a single GPU, we achieve speedups of up to 19x for SHAP values, and 340x for SHAP interaction values, over a state-of-the-art multi-core CPU implementation. We also experiment with an 8 GPU DGX-1 system, demonstrating throughput of 1.2M rows per second—equivalent CPU-based performance is estimated to require 6850 CPU cores.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/01/2022

Optimization of Oblivious Decision Tree Ensembles Evaluation for CPU

CatBoost is a popular machine learning library. CatBoost models are base...
research
05/05/2023

Verifiable Learning for Robust Tree Ensembles

Verifying the robustness of machine learning models against evasion atta...
research
09/20/2021

Fast TreeSHAP: Accelerating SHAP Value Computation for Trees

SHAP (SHapley Additive exPlanation) values are one of the leading tools ...
research
12/16/2021

Exact Shapley Values for Local and Model-True Explanations of Decision Tree Ensembles

Additive feature explanations using Shapley values have become popular f...
research
09/16/2022

Linear TreeShap

Decision trees are well-known due to their ease of interpretability. To ...
research
01/30/2018

Indistinguishable binomial decision tree of 3-SAT: Proof of class P is a proper subset of class NP

This paper solves a long standing open problem of whether NP-complete pr...
research
10/23/2020

Optimized Multivariate Polynomial Determinant on GPU

We present an optimized algorithm calculating determinant for multivaria...

Please sign up or login with your details

Forgot password? Click here to reset