Feature Importance: A Closer Look at Shapley Values and LOCO

03/10/2023
by   Isabella Verdinelli, et al.
0

There is much interest lately in explainability in statistics and machine learning. One aspect of explainability is to quantify the importance of various features (or covariates). Two popular methods for defining variable importance are LOCO (Leave Out COvariates) and Shapley Values. We take a look at the properties of these methods and their advantages and disadvantages. We are particularly interested in the effect of correlation between features which can obscure interpretability. Contrary to some claims, Shapley values do not eliminate feature correlation. We critique the game theoretic axioms for Shapley values and suggest some new axioms. We propose new, more statistically oriented axioms for feature importance and some measures that satisfy these axioms. However, correcting for correlation is a Faustian bargain: removing the effect of correlation creates other forms of bias. Ultimately, we recommend a slightly modified version of LOCO. We briefly consider how to modify Shapley values to better address feature correlation.

READ FULL TEXT
research
11/21/2021

Decorrelated Variable Importance

Because of the widespread use of black box prediction methods such as ra...
research
02/25/2020

Problems with Shapley-value-based explanations as feature importance measures

Game-theoretic formulations of feature importance have become popular as...
research
11/03/2020

Multicollinearity Correction and Combined Feature Effect in Shapley Values

Model interpretability is one of the most intriguing problems in most of...
research
10/12/2019

Measuring Unfairness through Game-Theoretic Interpretability

One often finds in the literature connections between measures of fairne...
research
10/22/2020

A Multilinear Sampling Algorithm to Estimate Shapley Values

Shapley values are great analytical tools in game theory to measure the ...
research
02/16/2023

The Inadequacy of Shapley Values for Explainability

This paper develops a rigorous argument for why the use of Shapley value...

Please sign up or login with your details

Forgot password? Click here to reset