A Multilinear Sampling Algorithm to Estimate Shapley Values

10/22/2020
by   Ramin Okhrati, et al.
0

Shapley values are great analytical tools in game theory to measure the importance of a player in a game. Due to their axiomatic and desirable properties such as efficiency, they have become popular for feature importance analysis in data science and machine learning. However, the time complexity to compute Shapley values based on the original formula is exponential, and as the number of features increases, this becomes infeasible. Castro et al. [1] developed a sampling algorithm, to estimate Shapley values. In this work, we propose a new sampling method based on a multilinear extension technique as applied in game theory. The aim is to provide a more efficient (sampling) method for estimating Shapley values. Our method is applicable to any machine learning model, in particular for either multi-class classifications or regression problems. We apply the method to estimate Shapley values for multilayer perceptrons (MLPs) and through experimentation on two datasets, we demonstrate that our method provides more accurate estimations of the Shapley values by reducing the variance of the sampling statistics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/25/2020

Problems with Shapley-value-based explanations as feature importance measures

Game-theoretic formulations of feature importance have become popular as...
research
06/19/2023

Explaining the Model and Feature Dependencies by Decomposition of the Shapley Value

Shapley values have become one of the go-to methods to explain complex m...
research
09/16/2023

Fast Approximation of the Shapley Values Based on Order-of-Addition Experimental Designs

Shapley value is originally a concept in econometrics to fairly distribu...
research
01/22/2021

Estimating α-Rank by Maximizing Information Gain

Game theory has been increasingly applied in settings where the game is ...
research
12/03/2020

Competition analysis on the over-the-counter credit default swap market

We study two questions related to competition on the OTC CDS market usin...
research
03/10/2023

Feature Importance: A Closer Look at Shapley Values and LOCO

There is much interest lately in explainability in statistics and machin...

Please sign up or login with your details

Forgot password? Click here to reset