EFSIS: Ensemble Feature Selection Integrating Stability

11/19/2018
by   Xiaokang Zhang, et al.
0

Ensemble learning that can be used to combine the predictions from multiple learners has been widely applied in pattern recognition, and has been reported to be more robust and accurate than the individual learners. This ensemble logic has recently also been more applied in feature selection. There are basically two strategies for ensemble feature selection, namely data perturbation and function perturbation. Data perturbation performs feature selection on data subsets sampled from the original dataset and then selects the features consistently ranked highly across those data subsets. This has been found to improve both the stability of the selector and the prediction accuracy for a classifier. Function perturbation frees the user from having to decide on the most appropriate selector for any given situation and works by aggregating multiple selectors. This has been found to maintain or improve classification performance. Here we propose a framework, EFSIS, combining these two strategies. Empirical results indicate that EFSIS gives both high prediction accuracy and stability.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/26/2020

Fast-Ensembles of Minimum Redundancy Feature Selection

Finding relevant subspaces in very high-dimensional data is a challengin...
research
08/03/2021

Fast Estimation Method for the Stability of Ensemble Feature Selectors

It is preferred that feature selectors be stable for better interpretabi...
research
07/31/2021

A Hybrid Ensemble Feature Selection Design for Candidate Biomarkers Discovery from Transcriptome Profiles

The discovery of disease biomarkers from gene expression data has been g...
research
11/30/2020

Utilizing stability criteria in choosing feature selection methods yields reproducible results in microbiome data

Feature selection is indispensable in microbiome data analysis, but it c...
research
01/21/2020

Wrapper Feature Selection Algorithm for the Optimization of an Indicator System of Patent Value Assessment

Effective patent value assessment provides decision support for patent t...
research
05/14/2023

Unraveling Cold Start Enigmas in Predictive Analytics for OTT Media: Synergistic Meta-Insights and Multimodal Ensemble Mastery

The cold start problem is a common challenge in various domains, includi...
research
05/20/1999

Robust Combining of Disparate Classifiers through Order Statistics

Integrating the outputs of multiple classifiers via combiners or meta-le...

Please sign up or login with your details

Forgot password? Click here to reset