Adaptive XGBoost for Evolving Data Streams

05/15/2020
by   Jacob Montiel, et al.
0

Boosting is an ensemble method that combines base models in a sequential manner to achieve high predictive accuracy. A popular learning algorithm based on this ensemble method is eXtreme Gradient Boosting (XGB). We present an adaptation of XGB for classification of evolving data streams. In this setting, new data arrives over time and the relationship between the class and the features may change in the process, thus exhibiting concept drift. The proposed method creates new members of the ensemble from mini-batches of data as new data becomes available. The maximum ensemble size is fixed, but learning does not stop when this size is reached because the ensemble is updated on new data to ensure consistency with the current concept. We also explore the use of concept drift detection to trigger a mechanism to update the ensemble. We test our method on real and synthetic data with concept drift and compare it against batch-incremental and instance-incremental classification methods for data streams.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/13/2020

Diverse Instances-Weighting Ensemble based on Region Drift Disagreement for Concept Drift Adaptation

Concept drift refers to changes in the distribution of underlying data a...
research
04/19/2023

Advances on Concept Drift Detection in Regression Tasks using Social Networks Theory

Mining data streams is one of the main studies in machine learning area ...
research
10/07/2021

A Broad Ensemble Learning System for Drifting Stream Classification

Data stream classification has become a major research topic due to the ...
research
05/14/2014

Active Mining of Parallel Video Streams

The practicality of a video surveillance system is adversely limited by ...
research
10/18/2017

Concept Drift Learning with Alternating Learners

Data-driven predictive analytics are in use today across a number of ind...
research
05/12/2022

Mondrian Forest for Data Stream Classification Under Memory Constraints

Supervised learning algorithms generally assume the availability of enou...
research
08/25/2022

Adaptive Learning for Service Monitoring Data

Service monitoring applications continuously produce data to monitor the...

Please sign up or login with your details

Forgot password? Click here to reset