Review and Evaluation of Feature Selection Algorithms in Synthetic Problems

01/12/2011
by   L. A. Belanche, et al.
0

The main purpose of Feature Subset Selection is to find a reduced subset of attributes from a data set described by a feature set. The task of a feature selection algorithm (FSA) is to provide with a computational solution motivated by a certain definition of relevance or by a reliable evaluation measure. In this paper several fundamental algorithms are studied to assess their performance in a controlled experimental scenario. A measure to evaluate FSAs is devised that computes the degree of matching between the output given by a FSA and the known optimal solutions. An extensive experimental study on synthetic problems is carried out to assess the behaviour of the algorithms in terms of solution accuracy and size as a function of the relevance, irrelevance, redundancy and size of the data samples. The controlled experimental conditions facilitate the derivation of better-supported and meaningful conclusions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/25/2011

The All Relevant Feature Selection using Random Forest

In this paper we examine the application of the random forest classifier...
research
08/27/2020

Feature Selection from High-Dimensional Data with Very Low Sample Size: A Cautionary Tale

In classification problems, the purpose of feature selection is to ident...
research
09/24/2015

A Review of Feature Selection Methods Based on Mutual Information

In this work we present a review of the state of the art of information ...
research
10/17/2020

MithraDetective: A System for Cherry-picked Trendlines Detection

Given a data set, misleading conclusions can be drawn from it by cherry-...
research
09/12/2015

Double Relief with progressive weighting function

Feature weighting algorithms try to solve a problem of great importance ...
research
08/06/2014

New crossover operators for multiple subset selection tasks

We have introduced two crossover operators, MMX-BLXexploit and MMX-BLXex...
research
02/07/2013

Feature Selection for Microarray Gene Expression Data using Simulated Annealing guided by the Multivariate Joint Entropy

In this work a new way to calculate the multivariate joint entropy is pr...

Please sign up or login with your details

Forgot password? Click here to reset