An Evolutionary Multitasking Algorithm with Multiple Filtering for High-Dimensional Feature Selection

by   Lingjie Li, et al.

Recently, evolutionary multitasking (EMT) has been successfully used in the field of high-dimensional classification. However, the generation of multiple tasks in the existing EMT-based feature selection (FS) methods is relatively simple, using only the Relief-F method to collect related features with similar importance into one task, which cannot provide more diversified tasks for knowledge transfer. Thus, this paper devises a new EMT algorithm for FS in high-dimensional classification, which first adopts different filtering methods to produce multiple tasks and then modifies a competitive swarm optimizer to efficiently solve these related tasks via knowledge transfer. First, a diversified multiple task generation method is designed based on multiple filtering methods, which generates several relevant low-dimensional FS tasks by eliminating irrelevant features. In this way, useful knowledge for solving simple and relevant tasks can be transferred to simplify and speed up the solution of the original high-dimensional FS task. Then, a competitive swarm optimizer is modified to simultaneously solve these relevant FS tasks by transferring useful knowledge among them. Numerous empirical results demonstrate that the proposed EMT-based FS method can obtain a better feature subset than several state-of-the-art FS methods on eighteen high-dimensional datasets.


page 1

page 15


SFE: A Simple, Fast and Efficient Feature Selection Algorithm for High-Dimensional Data

In this paper, a new feature selection algorithm, called SFE (Simple, Fa...

High-Dimensional Feature Selection for Genomic Datasets

In the presence of large dimensional datasets that contain many irreleva...

Quick and Robust Feature Selection: the Strength of Energy-efficient Sparse Training for Autoencoders

Major complications arise from the recent increase in the amount of high...

Efficient Learning of Minimax Risk Classifiers in High Dimensions

High-dimensional data is common in multiple areas, such as health care a...

Sliced ℒ_2 Distance for Colour Grading

We propose a new method with ℒ_2 distance that maps one N-dimensional di...

A Stable Combinatorial Particle Swarm Optimization for Scalable Feature Selection in Gene Expression Data

Evolutionary computation (EC) algorithms, such as discrete and multi-obj...

Testing for Feature Relevance: The HARVEST Algorithm

Feature selection with high-dimensional data and a very small proportion...

Please sign up or login with your details

Forgot password? Click here to reset