Wavelet-Based Hybrid Machine Learning Model for Out-of-distribution Internet Traffic Prediction

05/09/2022
by   Sajal Saha, et al.
0

Efficient prediction of internet traffic is essential for ensuring proactive management of computer networks. Nowadays, machine learning approaches show promising performance in modeling real-world complex traffic. However, most existing works assumed that model training and evaluation data came from identical distribution. But in practice, there is a high probability that the model will deal with data from a slightly or entirely unknown distribution in the deployment phase. This paper investigated and evaluated machine learning performances using eXtreme Gradient Boosting, Light Gradient Boosting Machine, Stochastic Gradient Descent, Gradient Boosting Regressor, CatBoost Regressor, and their stacked ensemble model using data from both identical and out-of distribution. Also, we proposed a hybrid machine learning model integrating wavelet decomposition for improving out-of-distribution prediction as standalone models were unable to generalize very well. Our experimental results show the best performance of the standalone ensemble model with an accuracy of 96.4 data. But its performance dropped significantly when tested with three different datasets having a distribution shift than the training set. However, our proposed hybrid model considerably reduces the performance gap between identical and out-of-distribution evaluation compared with the standalone model, indicating the decomposition technique's effectiveness in the case of out-of-distribution generalization.

READ FULL TEXT

page 1

page 5

research
05/03/2022

Towards an Ensemble Regressor Model for Anomalous ISP Traffic Prediction

Prediction of network traffic behavior is significant for the effective ...
research
04/15/2022

Accurate ADMET Prediction with XGBoost

The absorption, distribution, metabolism, excretion, and toxicity (ADMET...
research
05/07/2018

Wavelet Decomposition of Gradient Boosting

In this paper we introduce a significant improvement to the popular tree...
research
06/18/2020

Uncertainty in Gradient Boosting via Ensembles

Gradient boosting is a powerful machine learning technique that is parti...
research
07/01/2021

Ensemble Learning-Based Approach for Improving Generalization Capability of Machine Reading Comprehension Systems

Machine Reading Comprehension (MRC) is an active field in natural langua...
research
09/28/2022

On the Robustness of Ensemble-Based Machine Learning Against Data Poisoning

Machine learning is becoming ubiquitous. From financial to medicine, mac...
research
05/13/2021

Extending Models Via Gradient Boosting: An Application to Mendelian Models

Improving existing widely-adopted prediction models is often a more effi...

Please sign up or login with your details

Forgot password? Click here to reset