DriftSurf: A Risk-competitive Learning Algorithm under Concept Drift

03/13/2020
by   Ashraf Tahmasbi, et al.
0

When learning from streaming data, a change in the data distribution, also known as concept drift, can render a previously-learned model inaccurate and require training a new model. We present an adaptive learning algorithm that extends previous drift-detection-based methods by incorporating drift detection into a broader stable-state/reactive-state process. The advantage of our approach is that we can use aggressive drift detection in the stable state to achieve a high detection rate, but mitigate the false positive rate of standalone drift detection via a reactive state that reacts quickly to true drifts while eliminating most false positives. The algorithm is generic in its base learner and can be applied across a variety of supervised learning problems. Our theoretical analysis shows that the risk of the algorithm is competitive to an algorithm with oracle knowledge of when (abrupt) drifts occur. Experiments on synthetic and real datasets with concept drifts confirm our theoretical analysis.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/16/2022

Class Distribution Monitoring for Concept Drift Detection

We introduce Class Distribution Monitoring (CDM), an effective concept-d...
research
12/25/2012

Exponentially Weighted Moving Average Charts for Detecting Concept Drift

Classifying streaming data requires the development of methods which are...
research
04/24/2020

Concept Drift Detection via Equal Intensity k-means Space Partitioning

Data stream poses additional challenges to statistical classification ta...
research
09/25/2019

Online Semi-Supervised Concept Drift Detection with Density Estimation

Concept drift is formally defined as the change in joint distribution of...
research
08/09/2020

Concept Drift Detection: Dealing with MissingValues via Fuzzy Distance Estimations

In data streams, the data distribution of arriving observations at diffe...
research
05/03/2023

An Adaptive Algorithm for Learning with Unknown Distribution Drift

We develop and analyze a general technique for learning with an unknown ...
research
01/29/2018

On the Inter-relationships among Drift rate, Forgetting rate, Bias/variance profile and Error

We propose two general and falsifiable hypotheses about expectations on ...

Please sign up or login with your details

Forgot password? Click here to reset