MVMR-FS : Non-parametric feature selection algorithm based on Maximum inter-class Variation and Minimum Redundancy

07/27/2023
by   Haitao Nie, et al.
0

How to accurately measure the relevance and redundancy of features is an age-old challenge in the field of feature selection. However, existing filter-based feature selection methods cannot directly measure redundancy for continuous data. In addition, most methods rely on manually specifying the number of features, which may introduce errors in the absence of expert knowledge. In this paper, we propose a non-parametric feature selection algorithm based on maximum inter-class variation and minimum redundancy, abbreviated as MVMR-FS. We first introduce supervised and unsupervised kernel density estimation on the features to capture their similarities and differences in inter-class and overall distributions. Subsequently, we present the criteria for maximum inter-class variation and minimum redundancy (MVMR), wherein the inter-class probability distributions are employed to reflect feature relevance and the distances between overall probability distributions are used to quantify redundancy. Finally, we employ an AGA to search for the feature subset that minimizes the MVMR. Compared with ten state-of-the-art methods, MVMR-FS achieves the highest average accuracy and improves the accuracy by 5

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/24/2014

Mutual Information-Based Unsupervised Feature Transformation for Heterogeneous Feature Subset Selection

Conventional mutual information (MI) based feature selection (FS) method...
research
02/01/2015

Feature Selection with Redundancy-complementariness Dispersion

Feature selection has attracted significant attention in data mining and...
research
09/07/2017

Feature selection in high-dimensional dataset using MapReduce

This paper describes a distributed MapReduce implementation of the minim...
research
08/15/2019

Maximum Relevance and Minimum Redundancy Feature Selection Methods for a Marketing Machine Learning Platform

In machine learning applications for online product offerings and market...
research
01/08/2023

Analogical Relevance Index

Focusing on the most significant features of a dataset is useful both in...
research
06/14/2016

Max-Margin Feature Selection

Many machine learning applications such as in vision, biology and social...
research
10/09/2018

Deep supervised feature selection using Stochastic Gates

In this study, we propose a novel non-parametric embedded feature select...

Please sign up or login with your details

Forgot password? Click here to reset