Graph-Based Automatic Feature Selection for Multi-Class Classification via Mean Simplified Silhouette

09/05/2023
by   David Levin, et al.
0

This paper introduces a novel graph-based filter method for automatic feature selection (abbreviated as GB-AFS) for multi-class classification tasks. The method determines the minimum combination of features required to sustain prediction performance while maintaining complementary discriminating abilities between different classes. It does not require any user-defined parameters such as the number of features to select. The methodology employs the Jeffries-Matusita (JM) distance in conjunction with t-distributed Stochastic Neighbor Embedding (t-SNE) to generate a low-dimensional space reflecting how effectively each feature can differentiate between each pair of classes. The minimum number of features is selected using our newly developed Mean Simplified Silhouette (abbreviated as MSS) index, designed to evaluate the clustering results for the feature selection task. Experimental results on public data sets demonstrate the superior performance of the proposed GB-AFS over other filter-based techniques and automatic feature selection approaches. Moreover, the proposed algorithm maintained the accuracy achieved when utilizing all features, while using only 7% to 30% of the features. Consequently, this resulted in a reduction of the time needed for classifications, from 15% to 70%.

READ FULL TEXT
research
03/03/2023

Graph-based Extreme Feature Selection for Multi-class Classification Tasks

When processing high-dimensional datasets, a common pre-processing step ...
research
05/15/2011

Feature Selection for MAUC-Oriented Classification Systems

Feature selection is an important pre-processing step for many pattern c...
research
06/04/2015

Classification with many classes: challenges and pluses

The objective of the paper is to study accuracy of multi-class classific...
research
02/19/2023

Topological Feature Selection: A Graph-Based Filter Feature Selection Approach

In this paper, we introduce a novel unsupervised, graph-based filter fea...
research
05/11/2021

Two novel feature selection algorithms based on crowding distance

In this paper, two novel algorithms for features selection are proposed....
research
05/31/2023

Distance Rank Score: Unsupervised filter method for feature selection on imbalanced dataset

This paper presents a new filter method for unsupervised feature selecti...
research
03/08/2022

Model-free feature selection to facilitate automatic discovery of divergent subgroups in tabular data

Data-centric AI encourages the need of cleaning and understanding of dat...

Please sign up or login with your details

Forgot password? Click here to reset