Ultra-marginal Feature Importance

04/21/2022
by   Joseph Janssen, et al.
0

Scientists frequently prioritize learning from data rather than training the best possible model; however, research in machine learning often prioritizes the latter. Marginal feature importance methods, such as marginal contribution feature importance (MCI), attempt to break this trend by providing a useful framework for quantifying the relationships in data in an interpretable fashion. In this work, we generalize the framework of MCI while aiming to improve performance and runtime by introducing ultra-marginal feature importance (UMFI). To do so, we prove that UMFI can be computed directly by applying preprocessing methods from the AI fairness literature to remove dependencies in the feature set. We show on real and simulated data that UMFI performs at least as well as MCI, with significantly better performance in the presence of correlated interactions and unrelated features, while substantially reducing the exponential runtime of MCI to super-linear.

READ FULL TEXT

page 9

page 12

research
07/29/2021

Temporal Dependencies in Feature Importance for Time Series Predictions

Explanation methods applied to sequential models for multivariate time s...
research
10/15/2020

Marginal Contribution Feature Importance – an Axiomatic Approach for The Natural Case

When training a predictive model over medical data, the goal is sometime...
research
10/06/2022

Conditional Feature Importance for Mixed Data

Despite the popularity of feature importance measures in interpretable m...
research
02/15/2022

REPID: Regional Effect Plots with implicit Interaction Detection

Machine learning models can automatically learn complex relationships, s...
research
11/16/2019

Marginal and Interactive Feature Screening of Ultra-high Dimensional Feature Spaces with Multivariate Response

When the number of features exponentially outnumbers the number of sampl...
research
06/12/2022

Bounding and Approximating Intersectional Fairness through Marginal Fairness

Discrimination in machine learning often arises along multiple dimension...
research
08/28/2023

Causality-Based Feature Importance Quantifying Methods:PN-FI, PS-FI and PNS-FI

In current ML field models are getting larger and more complex, data we ...

Please sign up or login with your details

Forgot password? Click here to reset