Graph-based Extreme Feature Selection for Multi-class Classification Tasks

03/03/2023
by   Shir Friedman, et al.
0

When processing high-dimensional datasets, a common pre-processing step is feature selection. Filter-based feature selection algorithms are not tailored to a specific classification method, but rather rank the relevance of each feature with respect to the target and the task. This work focuses on a graph-based, filter feature selection method that is suited for multi-class classifications tasks. We aim to drastically reduce the number of selected features, in order to create a sketch of the original data that codes valuable information for the classification task. The proposed graph-based algorithm is constructed by combing the Jeffries-Matusita distance with a non-linear dimension reduction method, diffusion maps. Feature elimination is performed based on the distribution of the features in the low-dimensional space. Then, a very small number of feature that have complementary separation strengths, are selected. Moreover, the low-dimensional embedding allows to visualize the feature space. Experimental results are provided for public datasets and compared with known filter-based feature selection techniques.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/05/2023

Graph-Based Automatic Feature Selection for Multi-Class Classification via Mean Simplified Silhouette

This paper introduces a novel graph-based filter method for automatic fe...
research
01/05/2014

Feature Selection Using Classifier in High Dimensional Data

Feature selection is frequently used as a pre-processing step to machine...
research
05/15/2011

Feature Selection for MAUC-Oriented Classification Systems

Feature selection is an important pre-processing step for many pattern c...
research
02/19/2023

Topological Feature Selection: A Graph-Based Filter Feature Selection Approach

In this paper, we introduce a novel unsupervised, graph-based filter fea...
research
05/11/2021

Two novel feature selection algorithms based on crowding distance

In this paper, two novel algorithms for features selection are proposed....
research
04/09/2013

Image Classification by Feature Dimension Reduction and Graph based Ranking

Dimensionality reduction (DR) of image features plays an important role ...
research
05/31/2023

Distance Rank Score: Unsupervised filter method for feature selection on imbalanced dataset

This paper presents a new filter method for unsupervised feature selecti...

Please sign up or login with your details

Forgot password? Click here to reset