Active Weighted Aging Ensemble for Drifted Data Stream Classification

12/19/2021
by   Michał Woźniak, et al.
14

One of the significant problems of streaming data classification is the occurrence of concept drift, consisting of the change of probabilistic characteristics of the classification task. This phenomenon destabilizes the performance of the classification model and seriously degrades its quality. An appropriate strategy counteracting this phenomenon is required to adapt the classifier to the changing probabilistic characteristics. One of the significant problems in implementing such a solution is the access to data labels. It is usually costly, so to minimize the expenses related to this process, learning strategies based on semi-supervised learning are proposed, e.g., employing active learning methods indicating which of the incoming objects are valuable to be labeled for improving the classifier's performance. This paper proposes a novel chunk-based method for non-stationary data streams based on classifier ensemble learning and an active learning strategy considering a limited budget that can be successfully applied to any data stream classification algorithm. The proposed method has been evaluated through computer experiments using both real and generated data streams. The results confirm the high quality of the proposed algorithm over state-of-the-art methods.

READ FULL TEXT
research
10/25/2021

Employing chunk size adaptation to overcome concept drift

Modern analytical systems must be ready to process streaming data and co...
research
04/14/2022

Stream-based Active Learning with Verification Latency in Non-stationary Environments

Data stream classification is an important problem in the field of machi...
research
02/26/2020

Streaming Active Deep Forest for Evolving Data Stream Classification

In recent years, Deep Neural Networks (DNNs) have gained progressive mom...
research
12/21/2021

Mining Drifting Data Streams on a Budget: Combining Active Learning with Self-Labeling

Mining data streams poses a number of challenges, including the continuo...
research
05/14/2014

Active Mining of Parallel Video Streams

The practicality of a video surveillance system is adversely limited by ...
research
09/03/2015

Incremental Active Opinion Learning Over a Stream of Opinionated Documents

Applications that learn from opinionated documents, like tweets or produ...
research
09/20/2020

Instance exploitation for learning temporary concepts from sparsely labeled drifting data streams

Continual learning from streaming data sources becomes more and more pop...

Please sign up or login with your details

Forgot password? Click here to reset