Hybrid Forest: A Concept Drift Aware Data Stream Mining Algorithm

02/10/2019
by   Radin Hamidi Rad, et al.
0

Nowadays with a growing number of online controlling systems in the organization and also a high demand of monitoring and stats facilities that uses data streams to log and control their subsystems, data stream mining becomes more and more vital. Hoeffding Trees (also called Very Fast Decision Trees a.k.a. VFDT) as a Big Data approach in dealing with the data stream for classification and regression problems showed good performance in handling facing challenges and making the possibility of any-time prediction. Although these methods outperform other methods e.g. Artificial Neural Networks (ANN) and Support Vector Regression (SVR), they suffer from high latency in adapting with new concepts when the statistical distribution of incoming data changes. In this article, we introduced a new algorithm that can detect and handle concept drift phenomenon properly. This algorithms also benefits from fast startup ability which helps systems to be able to predict faster than other algorithms at the beginning of data stream arrival. We also have shown that our approach will overperform other controversial approaches for classification and regression tasks.

READ FULL TEXT
research
09/21/2020

CURIE: A Cellular Automaton for Concept Drift Detection

Data stream mining extracts information from large quantities of data fl...
research
04/20/2020

Multi-label Stream Classification with Self-Organizing Maps

Several learning algorithms have been proposed for offline multi-label c...
research
11/03/2015

Data Stream Classification using Random Feature Functions and Novel Method Combinations

Big Data streams are being generated in a faster, bigger, and more commo...
research
07/16/2019

Online Local Boosting: improving performance in online decision trees

As more data are produced each day, and faster, data stream mining is gr...
research
05/12/2022

Mondrian Forest for Data Stream Classification Under Memory Constraints

Supervised learning algorithms generally assume the availability of enou...
research
06/21/2023

An efficient and straightforward online quantization method for a data stream through remove-birth updating

The growth of network-connected devices is creating an explosion of data...

Please sign up or login with your details

Forgot password? Click here to reset