LEAF: Navigating Concept Drift in Cellular Networks

09/07/2021
by   Shinan Liu, et al.
0

Operational networks commonly rely on machine learning models for many tasks, including detecting anomalies, inferring application performance, and forecasting demand. Yet, unfortunately, model accuracy can degrade due to concept drift, whereby the relationship between the features and the target prediction changes due to reasons ranging from software upgrades to seasonality to changes in user behavior. Mitigating concept drift is thus an essential part of operationalizing machine learning models, and yet despite its importance, concept drift has not been extensively explored in the context of networking – or regression models in general. Thus, it is not well-understood how to detect or mitigate it for many common network management tasks that currently rely on machine learning models. As we show, concept drift cannot always be mitigated by periodic retraining models using newly available data, and doing so can even degrade model accuracy. In this paper, we characterize concept drift in a large cellular network for a metropolitan area in the United States. We find that concept drift occurs across key performance indicators (KPIs), regardless of model, training set size, and time interval – thus necessitating practical approaches to detect, explain, and mitigate it. To do so, we develop Local Error Approximation of Features (LEAF). LEAF detects drift; explains features and time intervals that most contribute to drift; and mitigates drift using resampling, augmentation, or ensembling. We evaluate LEAF against industry-standard mitigations (i.e., periodic retraining) with more than three years of cellular data from Verizon. LEAF consistently outperforms periodic retraining on a variety of KPIs and models, while reducing costly retrains by an order of magnitude. Due to its effectiveness, a major cellular carrier is now integrating LEAF into its forecasting and provisioning processes.

READ FULL TEXT

page 8

page 14

research
04/01/2020

Handling Concept Drifts in Regression Problems – the Error Intersection Approach

Machine learning models are omnipresent for predictions on big data. One...
research
09/21/2020

CURIE: A Cellular Automaton for Concept Drift Detection

Data stream mining extracts information from large quantities of data fl...
research
09/07/2023

Uncovering Drift in Textual Data: An Unsupervised Method for Detecting and Mitigating Drift in Machine Learning Models

Drift in machine learning refers to the phenomenon where the statistical...
research
01/20/2023

Feature Relevance Analysis to Explain Concept Drift – A Case Study in Human Activity Recognition

This article studies how to detect and explain concept drift. Human acti...
research
02/15/2021

Unified Shapley Framework to Explain Prediction Drift

Predictions are the currency of a machine learning model, and to underst...
research
07/30/2021

Tiny Machine Learning for Concept Drift

Tiny Machine Learning (TML) is a new research area whose goal is to desi...
research
11/12/2015

Characterizing Concept Drift

Most machine learning models are static, but the world is dynamic, and i...

Please sign up or login with your details

Forgot password? Click here to reset