Causality-based Feature Selection: Methods and Evaluations

11/17/2019
by   Kui Yu, et al.
39

Feature selection is a crucial preprocessing step in data analytics and machine learning. Classical feature selection algorithms select features based on the correlations between predictive features and the class variable and do not attempt to capture causal relationships between them. It has been shown that the knowledge about the causal relationships between features and the class variable has potential benefits for building interpretable and robust prediction models, since causal relationships imply the underlying mechanism of a system. Consequently, causality-based feature selection has gradually attracted greater attentions and many algorithms have been proposed. In this paper, we present a comprehensive review of recent advances in causality-based feature selection. To facilitate the development of new algorithms in the research area and make it easy for the comparisons between new methods and existing ones, we develop the first open-source package, called CausalFS, which consists of most of the representative causality-based feature selection algorithms (available at https://github.com/kuiy/CausalFS). Using CausalFS, we conduct extensive experiments to compare the representative algorithms with both synthetic and real-world data sets. Finally, we discuss some challenging problems to be tackled in future causality-based feature selection research.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/16/2018

A Unified View of Causal and Non-causal Feature Selection

In this paper, we unify causal and non-causal feature feature selection ...
research
08/28/2023

Causality-Based Feature Importance Quantifying Methods:PN-FI, PS-FI and PNS-FI

In current ML field models are getting larger and more complex, data we ...
research
07/05/2020

Handling high correlations in the feature gene selection using Single-Cell RNA sequencing data

Motivation: Selecting feature genes and predicting cells' phenotype are ...
research
04/11/2023

Selecting Robust Features for Machine Learning Applications using Multidata Causal Discovery

Robust feature selection is vital for creating reliable and interpretabl...
research
09/22/2020

Using Unsupervised Learning to Help Discover the Causal Graph

The software outlined in this paper, AitiaExplorer, is an exploratory ca...
research
10/09/2020

Causal Feature Selection with Dimension Reduction for Interpretable Text Classification

Text features that are correlated with class labels, but do not directly...
research
12/21/2020

Personalized fall detection monitoring system based on learning from the user movements

Personalized fall detection system is shown to provide added and more be...

Please sign up or login with your details

Forgot password? Click here to reset