Hybrid Ensemble optimized algorithm based on Genetic Programming for imbalanced data classification

06/02/2021
by   Maliheh Roknizadeh, et al.
0

One of the most significant current discussions in the field of data mining is classifying imbalanced data. In recent years, several ways are proposed such as algorithm level (internal) approaches, data level (external) techniques, and cost-sensitive methods. Although extensive research has been carried out on imbalanced data classification, however, several unsolved challenges remain such as no attention to the importance of samples to balance, determine the appropriate number of classifiers, and no optimization of classifiers in the combination of classifiers. The purpose of this paper is to improve the efficiency of the ensemble method in the sampling of training data sets, especially in the minority class, and to determine better basic classifiers for combining classifiers than existing methods. We proposed a hybrid ensemble algorithm based on Genetic Programming (GP) for two classes of imbalanced data classification. In this study uses historical data from UCI Machine Learning Repository to assess minority classes in imbalanced datasets. The performance of our proposed algorithm is evaluated by Rapid-miner studio v.7.5. Experimental results show the performance of the proposed method on the specified data sets in the size of the training set shows 40 accuracy than other dimensions of the minority class prediction.

READ FULL TEXT

page 7

page 9

research
12/18/2017

MEBoost: Mixing Estimators with Boosting for Imbalanced Data Classification

Class imbalance problem has been a challenging research problem in the f...
research
08/09/2019

Adaptive Ensemble of Classifiers with Regularization for Imbalanced Data Classification

Dynamic ensembling of classifiers is an effective approach in processing...
research
08/27/2019

Deep Learning-Based Strategy for Macromolecules Classification with Imbalanced Data from Cellular Electron Cryotomography

Deep learning model trained by imbalanced data may not work satisfactori...
research
06/28/2016

Reviving Threshold-Moving: a Simple Plug-in Bagging Ensemble for Binary and Multiclass Imbalanced Data

Class imbalance presents a major hurdle in the application of data minin...
research
09/22/2021

Vehicle Behavior Prediction and Generalization Using Imbalanced Learning Techniques

The use of learning-based methods for vehicle behavior prediction is a p...
research
01/04/2021

A Novel Bio-Inspired Hybrid Multi-Filter Wrapper Gene Selection Method with Ensemble Classifier for Microarray Data

Microarray technology is known as one of the most important tools for co...

Please sign up or login with your details

Forgot password? Click here to reset