Employing chunk size adaptation to overcome concept drift

10/25/2021
by   Jędrzej Kozal, et al.
4

Modern analytical systems must be ready to process streaming data and correctly respond to data distribution changes. The phenomenon of changes in data distributions is called concept drift, and it may harm the quality of the used models. Additionally, the possibility of concept drift appearance causes that the used algorithms must be ready for the continuous adaptation of the model to the changing data distributions. This work focuses on non-stationary data stream classification, where a classifier ensemble is used. To keep the ensemble model up to date, the new base classifiers are trained on the incoming data blocks and added to the ensemble while, at the same time, outdated models are removed from the ensemble. One of the problems with this type of model is the fast reaction to changes in data distributions. We propose a new Chunk Adaptive Restoration framework that can be adapted to any block-based data stream classification algorithm. The proposed algorithm adjusts the data chunk size in the case of concept drift detection to minimize the impact of the change on the predictive performance of the used model. The conducted experimental research, backed up with the statistical tests, has proven that Chunk Adaptive Restoration significantly reduces the model's restoration time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/07/2019

Multi-Source Transfer Learning for Non-Stationary Environments

In data stream mining, predictive models typically suffer drops in predi...
research
12/19/2021

Active Weighted Aging Ensemble for Drifted Data Stream Classification

One of the significant problems of streaming data classification is the ...
research
01/25/2022

Bilevel Online Deep Learning in Non-stationary Environment

Recent years have witnessed enormous progress of online learning. Howeve...
research
07/10/2020

Reactive Soft Prototype Computing for Concept Drift Streams

The amount of real-time communication between agents in an information s...
research
11/07/2020

Enhash: A Fast Streaming Algorithm For Concept Drift Detection

We propose Enhash, a fast ensemble learner that detects concept drift in...
research
07/15/2019

ParaFIS:A new online fuzzy inference system based on parallel drift anticipation

This paper proposes a new architecture of incremen-tal fuzzy inference s...
research
03/18/2019

Prototype-based classifiers in the presence of concept drift: A modelling framework

We present a modelling framework for the investigation of prototype-base...

Please sign up or login with your details

Forgot password? Click here to reset