An Evolutionary Multitasking Algorithm with Multiple Filtering for High-Dimensional Feature Selection

12/17/2022
by   Lingjie Li, et al.
0

Recently, evolutionary multitasking (EMT) has been successfully used in the field of high-dimensional classification. However, the generation of multiple tasks in the existing EMT-based feature selection (FS) methods is relatively simple, using only the Relief-F method to collect related features with similar importance into one task, which cannot provide more diversified tasks for knowledge transfer. Thus, this paper devises a new EMT algorithm for FS in high-dimensional classification, which first adopts different filtering methods to produce multiple tasks and then modifies a competitive swarm optimizer to efficiently solve these related tasks via knowledge transfer. First, a diversified multiple task generation method is designed based on multiple filtering methods, which generates several relevant low-dimensional FS tasks by eliminating irrelevant features. In this way, useful knowledge for solving simple and relevant tasks can be transferred to simplify and speed up the solution of the original high-dimensional FS task. Then, a competitive swarm optimizer is modified to simultaneously solve these relevant FS tasks by transferring useful knowledge among them. Numerous empirical results demonstrate that the proposed EMT-based FS method can obtain a better feature subset than several state-of-the-art FS methods on eighteen high-dimensional datasets.

READ FULL TEXT

page 1

page 15

research
03/17/2023

SFE: A Simple, Fast and Efficient Feature Selection Algorithm for High-Dimensional Data

In this paper, a new feature selection algorithm, called SFE (Simple, Fa...
research
02/27/2020

High-Dimensional Feature Selection for Genomic Datasets

In the presence of large dimensional datasets that contain many irreleva...
research
12/01/2020

Quick and Robust Feature Selection: the Strength of Energy-efficient Sparse Training for Autoencoders

Major complications arise from the recent increase in the amount of high...
research
06/11/2023

Efficient Learning of Minimax Risk Classifiers in High Dimensions

High-dimensional data is common in multiple areas, such as health care a...
research
02/18/2021

Sliced ℒ_2 Distance for Colour Grading

We propose a new method with ℒ_2 distance that maps one N-dimensional di...
research
01/24/2019

A Stable Combinatorial Particle Swarm Optimization for Scalable Feature Selection in Gene Expression Data

Evolutionary computation (EC) algorithms, such as discrete and multi-obj...
research
09/30/2017

Testing for Feature Relevance: The HARVEST Algorithm

Feature selection with high-dimensional data and a very small proportion...

Please sign up or login with your details

Forgot password? Click here to reset