Concept Drift Detection and Adaptation with Weak Supervision on Streaming Unlabeled Data

10/02/2019
by   Abhijit Suprem, et al.
0

Concept drift in learning and classification occurs when the statistical properties of either the data features or target change over time; evidence of drift has appeared in search data, medical research, malware, web data, and video. Drift adaptation has not yet been addressed in high dimensional, noisy, low-context data such as streaming text, video, or images due to the unique challenges these domains present. We present a two-fold approach to deal with concept drift in these domains: a density-based clustering approach to deal with virtual concept drift (change in statistical properties of features) and a weak-supervision step to deal with real concept drift (change in statistical properties of target). Our density-based clustering avoids problems posed by the curse of dimensionality to create an evolving 'map' of the live data space, thereby addressing virtual drift in features. Our weak-supervision step leverages high-confidence labels (oracle or heuristic labels) to generate weighted training sets to generalize and update existing deep learners to adapt to changing decision boundaries (real drift) and create new deep learners for unseen regions of the data space. Our results show that our two-fold approach performs well with >90 in 2014, without any human intervention.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/25/2019

Online Semi-Supervised Concept Drift Detection with Density Estimation

Concept drift is formally defined as the change in joint distribution of...
research
10/22/2010

Learning under Concept Drift: an Overview

Concept drift refers to a non stationary learning problem over time. The...
research
11/07/2020

Enhash: A Fast Streaming Algorithm For Concept Drift Detection

We propose Enhash, a fast ensemble learner that detects concept drift in...
research
03/09/2017

Information Extraction in Illicit Domains

Extracting useful entities and attribute values from illicit domains suc...
research
07/25/2017

Concept Drift Detection and Adaptation with Hierarchical Hypothesis Testing

In a streaming environment, there is often a need for statistical predic...
research
06/02/2023

An Adaptive Method for Weak Supervision with Drifting Data

We introduce an adaptive method with formal quality guarantees for weak ...

Please sign up or login with your details

Forgot password? Click here to reset