Forgetful Forests: high performance learning data structures for streaming data under concept drift

12/15/2022
by   Zhehu Yuan, et al.
0

Database research can help machine learning performance in many ways. One way is to design better data structures. This paper combines the use of incremental computation and sequential and probabilistic filtering to enable "forgetful" tree-based learning algorithms to cope with concept drift data (i.e., data whose function from input to classification changes over time). The forgetful algorithms described in this paper achieve high time performance while maintaining high quality predictions on streaming data. Specifically, the algorithms are up to 24 times faster than state-of-the-art incremental algorithms with at most a 2 faster without any loss of accuracy. This makes such structures suitable for high volume streaming applications.

READ FULL TEXT

page 17

page 18

page 19

research
04/13/2020

Learning under Concept Drift: A Review

Concept drift describes unforeseeable changes in the underlying distribu...
research
09/20/2020

Adversarial Concept Drift Detection under Poisoning Attacks for Robust Data Stream Mining

Continuous learning from streaming data is among the most challenging to...
research
04/01/2022

Concept Drift Adaptation for CTR Prediction in Online Advertising Systems

Click-through rate (CTR) prediction is a crucial task in web search, rec...
research
07/23/2020

Recursive Variable-Length State Compression for Multi-Core Software Model Checking

High-performance multi-core software typically uses concurrent data stru...
research
07/24/2019

Towards AutoML in the presence of Drift: first results

Research progress in AutoML has lead to state of the art solutions that ...
research
12/07/2020

Passive Approach for the K-means Problem on Streaming Data

Currently the amount of data produced worldwide is increasing beyond mea...
research
11/17/2017

Algorithms and Data Structures to Accelerate Network Analysis

As the sheer amount of computer generated data continues to grow exponen...

Please sign up or login with your details

Forgot password? Click here to reset