Model-agnostic Feature Importance and Effects with Dependent Features – A Conditional Subgroup Approach

06/08/2020
by   Christoph Molnar, et al.
0

Partial dependence plots and permutation feature importance are popular model-agnostic interpretation methods. Both methods are based on predicting artificially created data points. When features are dependent, both methods extrapolate to feature areas with low data density. The extrapolation can cause misleading interpretations. To overcome extrapolation, we propose conditional variants of partial dependence plots and permutation feature importance. Our approach is based on perturbations in subgroups. The subgroups partition the feature space to make the feature distribution within a group more homogeneous and between the groups more heterogeneous. The interpretable subgroups enable additional local, nuanced interpretations of the feature dependence structure as well as the feature effects and importance values within the subgroups. We also introduce a data fidelity measure that captures the degree of extrapolation when data is transformed with a certain perturbation. In simulations and benchmarks on real data we show that our conditional interpretation methods reduce extrapolation. In an application we show that these methods provide more nuanced and richer explanations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/23/2021

Grouped Feature Importance and Combined Features Effect Plot

Interpretable machine learning has become a very active area of research...
research
09/21/2022

Algorithm-Agnostic Interpretations for Clustering

A clustering outcome for high-dimensional data is typically interpreted ...
research
05/01/2019

Please Stop Permuting Features: An Explanation and Alternatives

This paper advocates against permute-and-predict (PaP) methods for inter...
research
04/09/2021

Transforming Feature Space to Interpret Machine Learning Models

Model-agnostic tools for interpreting machine-learning models struggle t...
research
09/03/2021

Relating the Partial Dependence Plot and Permutation Feature Importance to the Data Generating Process

Scientists and practitioners increasingly rely on machine learning to mo...
research
06/01/2023

Decomposing Global Feature Effects Based on Feature Interactions

Global feature effect methods, such as partial dependence plots, provide...
research
09/06/2021

Bringing a Ruler Into the Black Box: Uncovering Feature Impact from Individual Conditional Expectation Plots

As machine learning systems become more ubiquitous, methods for understa...

Please sign up or login with your details

Forgot password? Click here to reset