Hellinger Distance Weighted Ensemble for Imbalanced Data Stream Classification

01/30/2021
by   Joanna Grzyb, et al.
0

The imbalanced data classification remains a vital problem. The key is to find such methods that classify both the minority and majority class correctly. The paper presents the classifier ensemble for classifying binary, non-stationary and imbalanced data streams where the Hellinger Distance is used to prune the ensemble. The paper includes an experimental evaluation of the method based on the conducted experiments. The first one checks the impact of the base classifier type on the quality of the classification. In the second experiment, the Hellinger Distance Weighted Ensemble (HDWE) method is compared to selected state-of-the-art methods using a statistical test with two base classifiers. The method was profoundly tested based on many imbalanced data streams and obtained results proved the HDWE method's usefulness.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/16/2020

Diversity-Aware Weighted Majority Vote Classifier for Imbalanced Data

In this paper, we propose a diversity-aware ensemble learning based algo...
research
04/07/2022

A survey on learning from imbalanced data streams: taxonomy, challenges, empirical study, and reproducible experimental framework

Class imbalance poses new challenges when it comes to classifying data s...
research
05/09/2014

Hellinger Distance Trees for Imbalanced Streams

Classifiers trained on data sets possessing an imbalanced class distribu...
research
08/09/2019

Adaptive Ensemble of Classifiers with Regularization for Imbalanced Data Classification

Dynamic ensembling of classifiers is an effective approach in processing...
research
10/02/2016

Sparsity-driven weighted ensemble classifier

In this letter, a novel weighted ensemble classifier is proposed that im...
research
09/15/2021

On-the-Fly Ensemble Pruning in Evolving Data Streams

Ensemble pruning is the process of selecting a subset of componentclassi...
research
03/19/2017

Native Language Identification using Stacked Generalization

Ensemble methods using multiple classifiers have proven to be the most s...

Please sign up or login with your details

Forgot password? Click here to reset