Concept Drift Detection and Adaptation with Hierarchical Hypothesis Testing

07/25/2017
by   Shujian Yu, et al.
0

In a streaming environment, there is often a need for statistical prediction models to detect and adapt to concept drifts (i.e., changes in the joint distribution between predictor and response variables) so as to mitigate deteriorating predictive performance over time. Various concept drift detection approaches have been proposed in the past decades. However, they do not perform well across different concept drift types (e.g., gradual or abrupt, recurrent or irregular) and different data stream distributions (e.g., balanced and imbalanced labels). This paper presents a novel framework that can detect and also adapt to the various concept drift types, even in the presence of imbalanced data labels. The framework leverages a hierarchical set of hypothesis tests in an online fashion to detect concept drifts and employs an adaptive training strategy to significantly boost its adaptation capability. The performance of the proposed framework is compared to benchmark approaches using both simulated and real-world datasets spanning the breadth of concept drift types. The proposed approach significantly outperforms benchmark solutions in terms of precision, delay of detection as well as the adaptability across different concepts.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/04/2015

Concept Drift Detection for Streaming Data

Common statistical prediction models often require and assume stationari...
research
06/25/2018

Request-and-Reverify: Hierarchical Hypothesis Testing for Concept Drift Detection with Expensive Labels

One important assumption underlying common classification models is the ...
research
09/29/2021

Customs Fraud Detection in the Presence of Concept Drift

Capturing the changing trade pattern is critical in customs fraud detect...
research
09/25/2019

Online Semi-Supervised Concept Drift Detection with Density Estimation

Concept drift is formally defined as the change in joint distribution of...
research
10/10/2022

A Hybrid Active-Passive Approach to Imbalanced Nonstationary Data Stream Classification

In real-world applications, the process generating the data might suffer...
research
10/02/2019

Concept Drift Detection and Adaptation with Weak Supervision on Streaming Unlabeled Data

Concept drift in learning and classification occurs when the statistical...
research
03/27/2021

Human-in-the-loop Handling of Knowledge Drift

We introduce and study knowledge drift (KD), a complex form of drift tha...

Please sign up or login with your details

Forgot password? Click here to reset