A Novel Community Detection Based Genetic Algorithm for Feature Selection

08/08/2020
by   Mehrdad Rostami, et al.
0

The selection of features is an essential data preprocessing stage in data mining. The core principle of feature selection seems to be to pick a subset of possible features by excluding features with almost no predictive information as well as highly associated redundant features. In the past several years, a variety of meta-heuristic methods were introduced to eliminate redundant and irrelevant features as much as possible from high-dimensional datasets. Among the main disadvantages of present meta-heuristic based approaches is that they are often neglecting the correlation between a set of selected features. In this article, for the purpose of feature selection, the authors propose a genetic algorithm based on community detection, which functions in three steps. The feature similarities are calculated in the first step. The features are classified by community detection algorithms into clusters throughout the second step. In the third step, features are picked by a genetic algorithm with a new community-based repair operation. Nine benchmark classification problems were analyzed in terms of the performance of the presented approach. Also, the authors have compared the efficiency of the proposed approach with the findings from four available algorithms for feature selection. The findings indicate that the new approach continuously yields improved classification accuracy.

READ FULL TEXT
research
01/30/2020

A Hybrid Two-layer Feature Selection Method Using GeneticAlgorithm and Elastic Net

Feature selection, as a critical pre-processing step for machine learnin...
research
09/20/2016

GAdaBoost: Accelerating Adaboost Feature Selection with Genetic Algorithms

Boosted cascade of simple features, by Viola and Jones, is one of the mo...
research
05/22/2019

Selection of a Minimal Number of Significant Porcine SNPs by an Information Gain and Genetic Algorithm Hybrid Model

A panel of large number of common Single Nucleotide Polymorphisms (SNPs)...
research
10/04/2022

Robust self-healing prediction model for high dimensional data

Owing to the advantages of increased accuracy and the potential to detec...
research
03/07/2014

Ant Colony based Feature Selection Heuristics for Retinal Vessel Segmentation

Features selection is an essential step for successful data classificati...
research
02/02/2018

Generating Redundant Features with Unsupervised Multi-Tree Genetic Programming

Recently, feature selection has become an increasingly important area of...

Please sign up or login with your details

Forgot password? Click here to reset