Ensuring Learning Guarantees on Concept Drift Detection with Statistical Learning Theory

06/24/2020
by   Lucas Pagliosa, et al.
0

Concept Drift (CD) detection intends to continuously identify changes in data stream behaviors, supporting researchers in the study and modeling of real-world phenomena. Motivated by the lack of learning guarantees in current CD algorithms, we decided to take advantage of the Statistical Learning Theory (SLT) to formalize the necessary requirements to ensure probabilistic learning bounds, so drifts would refer to actual changes in data rather than by chance. As discussed along this paper, a set of mathematical assumptions must be held in order to rely on SLT bounds, which are especially controversial in CD scenarios. Based on this issue, we propose a methodology to address those assumptions in CD scenarios and therefore ensure learning guarantees. Complementary, we assessed a set of relevant and known CD algorithms from the literature in light of our methodology. As main contribution, we expect this work to support researchers while designing and evaluating CD algorithms on different domains.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/13/2020

Learning under Concept Drift: A Review

Concept drift describes unforeseeable changes in the underlying distribu...
research
03/21/2022

From Concept Drift to Model Degradation: An Overview on Performance-Aware Drift Detectors

The dynamicity of real-world systems poses a significant challenge to de...
research
07/18/2019

Automating concept-drift detection by self-evaluating predictive model degradation

A key aspect of automating predictive machine learning entails the capab...
research
10/07/2016

Adaptive Convolutional ELM For Concept Drift Handling in Online Stream Data

In big data era, the data continuously generated and its distribution ma...
research
05/19/2023

OPTWIN: Drift identification with optimal sub-windows

Online Learning (OL) is a field of research that is increasingly gaining...
research
05/31/2022

Minimax Classification under Concept Drift with Multidimensional Adaptation and Performance Guarantees

The statistical characteristics of instance-label pairs often change wit...
research
10/04/2021

DenDrift: A Drift-Aware Algorithm for Host Profiling

Detecting and reacting to unauthorized actions is an essential task in s...

Please sign up or login with your details

Forgot password? Click here to reset