Imbalanced Data Stream Classification using Dynamic Ensemble Selection

09/17/2023
by   Priya. S, et al.
0

Modern streaming data categorization faces significant challenges from concept drift and class imbalanced data. This negatively impacts the output of the classifier, leading to improper classification. Furthermore, other factors such as the overlapping of multiple classes limit the extent of the correctness of the output. This work proposes a novel framework for integrating data pre-processing and dynamic ensemble selection, by formulating the classification framework for the nonstationary drifting imbalanced data stream, which employs the data pre-processing and dynamic ensemble selection techniques. The proposed framework was evaluated using six artificially generated data streams with differing imbalance ratios in combination with two different types of concept drifts. Each stream is composed of 200 chunks of 500 objects described by eight features and contains five concept drifts. Seven pre-processing techniques and two dynamic ensemble selection methods were considered. According to experimental results, data pre-processing combined with Dynamic Ensemble Selection techniques significantly delivers more accuracy when dealing with imbalanced data streams.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/15/2022

The Influence of Multiple Classes on Learning Online Classifiers from Imbalanced and Concept Drifting Data Streams

This work is aimed at the experimental studying the influence of local d...
research
09/26/2019

A Decision-Based Dynamic Ensemble Selection Method for Concept Drift

We propose an online method for concept driftdetection based on dynamic ...
research
04/07/2022

A survey on learning from imbalanced data streams: taxonomy, challenges, empirical study, and reproducible experimental framework

Class imbalance poses new challenges when it comes to classifying data s...
research
09/15/2021

On-the-Fly Ensemble Pruning in Evolving Data Streams

Ensemble pruning is the process of selecting a subset of componentclassi...
research
08/01/2023

Predicting Early Dropouts of an Active and Healthy Ageing App

In this work, we present a machine learning approach for predicting earl...
research
10/07/2021

A Broad Ensemble Learning System for Drifting Stream Classification

Data stream classification has become a major research topic due to the ...
research
03/29/2018

Modified SMOTE Using Mutual Information and Different Sorts of Entropies

SMOTE is one of the oversampling techniques for balancing the datasets a...

Please sign up or login with your details

Forgot password? Click here to reset