Noise-Augmented Boruta: The Neural Network Perturbation Infusion with Boruta Feature Selection

09/18/2023
by   Hassan Gharoun, et al.
0

With the surge in data generation, both vertically (i.e., volume of data) and horizontally (i.e., dimensionality), the burden of the curse of dimensionality has become increasingly palpable. Feature selection, a key facet of dimensionality reduction techniques, has advanced considerably to address this challenge. One such advancement is the Boruta feature selection algorithm, which successfully discerns meaningful features by contrasting them to their permutated counterparts known as shadow features. However, the significance of a feature is shaped more by the data's overall traits than by its intrinsic value, a sentiment echoed in the conventional Boruta algorithm where shadow features closely mimic the characteristics of the original ones. Building on this premise, this paper introduces an innovative approach to the Boruta feature selection algorithm by incorporating noise into the shadow variables. Drawing parallels from the perturbation analysis framework of artificial neural networks, this evolved version of the Boruta method is presented. Rigorous testing on four publicly available benchmark datasets revealed that this proposed technique outperforms the classic Boruta algorithm, underscoring its potential for enhanced, accurate feature selection.

READ FULL TEXT

page 1

page 7

research
01/22/2021

Does a Hybrid Neural Network based Feature Selection Model Improve Text Classification?

Text classification is a fundamental problem in the field of natural lan...
research
11/30/2022

Universal Feature Selection Tool (UniFeat): An Open-Source Tool for Dimensionality Reduction

The Universal Feature Selection Tool (UniFeat) is an open-source tool de...
research
04/05/2023

Selecting Features by their Resilience to the Curse of Dimensionality

Real-world datasets are often of high dimension and effected by the curs...
research
10/19/2020

A Uniformly Stable Algorithm For Unsupervised Feature Selection

High-dimensional data presents challenges for data management. Feature s...
research
01/31/2022

Compactness Score: A Fast Filter Method for Unsupervised Feature Selection

For feature engineering, feature selection seems to be an important rese...
research
07/24/2023

Stochastic Step-wise Feature Selection for Exponential Random Graph Models (ERGMs)

Statistical analysis of social networks provides valuable insights into ...
research
08/19/2016

Unsupervised Feature Selection Based on the Morisita Estimator of Intrinsic Dimension

This paper deals with a new filter algorithm for selecting the smallest ...

Please sign up or login with your details

Forgot password? Click here to reset