Improving Mutual Information based Feature Selection by Boosting Unique Relevance

12/09/2022
by   Shiyu Liu, et al.
0

Mutual Information (MI) based feature selection makes use of MI to evaluate each feature and eventually shortlists a relevant feature subset, in order to address issues associated with high-dimensional datasets. Despite the effectiveness of MI in feature selection, we notice that many state-of-the-art algorithms disregard the so-called unique relevance (UR) of features, and arrive at a suboptimal selected feature subset which contains a non-negligible number of redundant features. We point out that the heart of the problem is that all these MIBFS algorithms follow the criterion of Maximize Relevance with Minimum Redundancy (MRwMR), which does not explicitly target UR. This motivates us to augment the existing criterion with the objective of boosting unique relevance (BUR), leading to a new criterion called MRwMR-BUR. Depending on the task being addressed, MRwMR-BUR has two variants, termed MRwMR-BUR-KSG and MRwMR-BUR-CLF, which estimate UR differently. MRwMR-BUR-KSG estimates UR via a nearest-neighbor based approach called the KSG estimator and is designed for three major tasks: (i) Classification Performance. (ii) Feature Interpretability. (iii) Classifier Generalization. MRwMR-BUR-CLF estimates UR via a classifier based approach. It adapts UR to different classifiers, further improving the competitiveness of MRwMR-BUR for classification performance oriented tasks. The performance of both MRwMR-BUR-KSG and MRwMR-BUR-CLF is validated via experiments using six public datasets and three popular classifiers. Specifically, as compared to MRwMR, the proposed MRwMR-BUR-KSG improves the test accuracy by 2 selected, without increasing the algorithm complexity. MRwMR-BUR-CLF further improves the classification performance by 3.8 it also outperforms three popular classifier dependent feature selection methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/02/2018

Feature Selection Based on Unique Relevant Information for Health Data

Feature selection, which searches for the most representative features i...
research
01/08/2020

Inflammatory Bowel Disease Biomarkers of Human Gut Microbiota Selected via Ensemble Feature Selection Methods

The tremendous boost in the next generation sequencing and in the omics ...
research
11/24/2014

Mutual Information-Based Unsupervised Feature Transformation for Heterogeneous Feature Subset Selection

Conventional mutual information (MI) based feature selection (FS) method...
research
07/18/2022

High-Order Conditional Mutual Information Maximization for dealing with High-Order Dependencies in Feature Selection

This paper presents a novel feature selection method based on the condit...
research
08/03/2018

A Two-Dimensional (2-D) Learning Framework for Particle Swarm based Feature Selection

This paper proposes a new generalized two dimensional learning approach ...
research
10/02/2019

Geometric Online Adaptation: Graph-Based OSFS for Streaming Samples

Feature selection seeks a curated subset of available features such that...

Please sign up or login with your details

Forgot password? Click here to reset