MEBoost: Mixing Estimators with Boosting for Imbalanced Data Classification

12/18/2017
by   Farshid Rayhan, et al.
0

Class imbalance problem has been a challenging research problem in the fields of machine learning and data mining as most real life datasets are imbalanced. Several existing machine learning algorithms try to maximize the accuracy classification by correctly identifying majority class samples while ignoring the minority class. However, the concept of the minority class instances usually represents a higher interest than the majority class. Recently, several cost sensitive methods, ensemble models and sampling techniques have been used in literature in order to classify imbalance datasets. In this paper, we propose MEBoost, a new boosting algorithm for imbalanced datasets. MEBoost mixes two different weak learners with boosting to improve the performance on imbalanced datasets. MEBoost is an alternative to the existing techniques such as SMOTEBoost, RUSBoost, Adaboost, etc. The performance of MEBoost has been evaluated on 12 benchmark imbalanced datasets with state of the art ensemble methods like SMOTEBoost, RUSBoost, Easy Ensemble, EUSBoost, DataBoost. Experimental results show significant improvement over the other methods and it can be concluded that MEBoost is an effective and promising algorithm to deal with imbalance datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/12/2017

CUSBoost: Cluster-based Under-sampling with Boosting for Imbalanced Classification

Class imbalance classification is a challenging research problem in data...
research
11/15/2017

LIUBoost : Locality Informed Underboosting for Imbalanced Data Classification

The problem of class imbalance along with class-overlapping has become a...
research
10/17/2019

WOTBoost: Weighted Oversampling Technique in Boosting for imbalanced learning

Machine learning classifiers often stumble over imbalanced datasets wher...
research
12/22/2020

A Survey of Methods for Managing the Classification and Solution of Data Imbalance Problem

The problem of class imbalance is extensive for focusing on numerous app...
research
09/17/2022

AdaCC: Cumulative Cost-Sensitive Boosting for Imbalanced Classification

Class imbalance poses a major challenge for machine learning as most sup...
research
01/16/2020

Smart Data based Ensemble for Imbalanced Big Data Classification

Big Data scenarios pose a new challenge to traditional data mining algor...
research
06/02/2021

Hybrid Ensemble optimized algorithm based on Genetic Programming for imbalanced data classification

One of the most significant current discussions in the field of data min...

Please sign up or login with your details

Forgot password? Click here to reset