Approximation of group explainers with coalition structure using Monte Carlo sampling on the product space of coalitions and features

In recent years, many Machine Learning (ML) explanation techniques have been designed using ideas from cooperative game theory. These game-theoretic explainers suffer from high complexity, hindering their exact computation in practical settings. In our work, we focus on a wide class of linear game values, as well as coalitional values, for the marginal game based on a given ML model and predictor vector. By viewing these explainers as expectations over appropriate sample spaces, we design a novel Monte Carlo sampling algorithm that estimates them at a reduced complexity that depends linearly on the size of the background dataset. We set up a rigorous framework for the statistical analysis and obtain error bounds for our sampling methods. The advantage of this approach is that it is fast, easily implementable, and model-agnostic. Furthermore, it has similar statistical accuracy as other known estimation techniques that are more complex and model-specific. We provide rigorous proofs of statistical convergence, as well as numerical experiments whose results agree with our theoretical findings.

READ FULL TEXT
research
04/25/2021

Sampling Permutations for Shapley Value Estimation

Game-theoretic attribution techniques based on Shapley values are used e...
research
02/22/2021

Mutual information-based group explainers with coalition structure for machine learning model explanations

In this article, we propose and investigate ML group explainers in a gen...
research
11/08/2015

Sandwiching the marginal likelihood using bidirectional Monte Carlo

Computing the marginal likelihood (ML) of a model requires marginalizing...
research
04/28/2022

A new certified hierarchical and adaptive RB-ML-ROM surrogate model for parametrized PDEs

We present a new surrogate modeling technique for efficient approximatio...
research
09/16/2023

Fast Approximation of the Shapley Values Based on Order-of-Addition Experimental Designs

Shapley value is originally a concept in econometrics to fairly distribu...
research
07/06/2019

Precision annealing Monte Carlo methods for statistical data assimilation and machine learning

In statistical data assimilation (SDA) and supervised machine learning (...
research
10/22/2020

A Multilinear Sampling Algorithm to Estimate Shapley Values

Shapley values are great analytical tools in game theory to measure the ...

Please sign up or login with your details

Forgot password? Click here to reset