Concept Drift and Covariate Shift Detection Ensemble with Lagged Labels

12/08/2020
by   Yiming Xu, et al.
0

In model serving, having one fixed model during the entire often life-long inference process is usually detrimental to model performance, as data distribution evolves over time, resulting in lack of reliability of the model trained on historical data. It is important to detect changes and retrain the model in time. The existing methods generally have three weaknesses: 1) using only classification error rate as signal, 2) assuming ground truth labels are immediately available after features from samples are received and 3) unable to decide what data to use to retrain the model when change occurs. We address the first problem by utilizing six different signals to capture a wide range of characteristics of data, and we address the second problem by allowing lag of labels, where labels of corresponding features are received after a lag in time. For the third problem, our proposed method automatically decides what data to use to retrain based on the signals. Extensive experiments on structured and unstructured data for different type of data changes establish that our method consistently outperforms the state-of-the-art methods by a large margin.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/18/2022

Estimating and Explaining Model Performance When Both Covariates and Labels Shift

Deployed machine learning (ML) models often encounter new user data that...
research
06/19/2022

Gray Learning from Non-IID Data with Out-of-distribution Samples

The quality of the training data annotated by experts cannot be guarante...
research
08/16/2021

Task-Sensitive Concept Drift Detector with Constraint Embedding

Detecting drifts in data is essential for machine learning applications,...
research
09/14/2023

Detecting Misinformation with LLM-Predicted Credibility Signals and Weak Supervision

Credibility signals represent a wide range of heuristics that are typica...
research
05/07/2018

Label Refinery: Improving ImageNet Classification through Label Progression

Among the three main components (data, labels, and models) of any superv...
research
06/16/2022

Gradient-Based Adversarial and Out-of-Distribution Detection

We propose to utilize gradients for detecting adversarial and out-of-dis...
research
02/10/2020

Multitask Emotion Recognition with Incomplete Labels

We train a unified model to perform three tasks: facial action unit dete...

Please sign up or login with your details

Forgot password? Click here to reset