Causality-Based Feature Importance Quantifying Methods:PN-FI, PS-FI and PNS-FI

08/28/2023
by   Shuxian Du, et al.
0

In current ML field models are getting larger and more complex, data we use are also getting larger in quantity and higher in dimension, so in order to train better models, save training time and computational resources, a good Feature Selection (FS) method in preprocessing stage is necessary. Feature importance (FI) is of great importance since it is the basis of feature selection. This paper creatively introduces the calculation of PNS(the probability of Necessity and Sufficiency) in Causality into quantifying feature importance and creates new FI measuring methods: PN-FI, which means how much importance a feature has in image recognition tasks, PS_FI that means how much importance a feature has in image generating tasks, and PNS_FI which measures both. The main body of this paper is three RCTs, with whose results we show how PS_FI, PN_FI and PNS_FI of three features: dog nose, dog eyes and dog mouth are calculated. The FI values are intervals with tight upper and lower bounds.

READ FULL TEXT
research
11/17/2019

Causality-based Feature Selection: Methods and Evaluations

Feature selection is a crucial preprocessing step in data analytics and ...
research
06/26/2019

A Debiased MDI Feature Importance Measure for Random Forests

Tree ensembles such as Random Forests have achieved impressive empirical...
research
09/05/2021

Scalable Feature Selection for (Multitask) Gradient Boosted Trees

Gradient Boosted Decision Trees (GBDTs) are widely used for building ran...
research
06/08/2020

Nonparametric Feature Impact and Importance

Practitioners use feature importance to rank and eliminate weak predicto...
research
10/02/2022

Ensembling improves stability and power of feature selection for deep learning models

With the growing adoption of deep learning models in different real-worl...
research
11/16/2021

Outlier Detection as Instance Selection Method for Feature Selection in Time Series Classification

In order to allow machine learning algorithms to extract knowledge from ...
research
04/21/2022

Ultra-marginal Feature Importance

Scientists frequently prioritize learning from data rather than training...

Please sign up or login with your details

Forgot password? Click here to reset