Towards an Ensemble Regressor Model for Anomalous ISP Traffic Prediction

05/03/2022
by   Sajal Saha, et al.
0

Prediction of network traffic behavior is significant for the effective management of modern telecommunication networks. However, the intuitive approach of predicting network traffic using administrative experience and market analysis data is inadequate for an efficient forecast framework. As a result, many different mathematical models have been studied to capture the general trend of the network traffic and predict accordingly. But the comprehensive performance analysis of varying regression models and their ensemble has not been studied before for analyzing real-world anomalous traffic. In this paper, several regression models such as Extra Gradient Boost (XGBoost), Light Gradient Boosting Machine (LightGBM), Stochastic Gradient Descent (SGD), Gradient Boosting Regressor (GBR), and CatBoost Regressor were analyzed to predict real traffic without and with outliers and show the significance of outlier detection in real-world traffic prediction. Also, we showed the outperformance of the ensemble regression model over the individual prediction model. We compared the performance of different regression models based on five different feature sets of lengths 6, 9, 12, 15, and 18. Our ensemble regression model achieved the minimum average gap of 5.04 actual and predicted traffic with nine outlier-adjusted inputs. In general, our experimental results indicate that the outliers in the data can significantly impact the quality of the prediction. Thus, outlier detection and mitigation assist the regression model in learning the general trend and making better predictions.

READ FULL TEXT
research
05/03/2022

Deep Sequence Modeling for Anomalous ISP Traffic Prediction

Internet traffic in the real world is susceptible to various external an...
research
05/09/2022

Wavelet-Based Hybrid Machine Learning Model for Out-of-distribution Internet Traffic Prediction

Efficient prediction of internet traffic is essential for ensuring proac...
research
05/29/2019

Arterial incident duration prediction using a bi-level framework of extreme gradient-tree boosting

Predicting traffic incident duration is a major challenge for many traff...
research
12/18/2018

A residual for outlier identification in zero adjusted regression models

Zero adjusted regression models are used to fit variables that are discr...
research
06/06/2023

DEK-Forecaster: A Novel Deep Learning Model Integrated with EMD-KNN for Traffic Prediction

Internet traffic volume estimation has a significant impact on the busin...
research
05/03/2022

An Empirical Study on Internet Traffic Prediction Using Statistical Rolling Model

Real-world IP network traffic is susceptible to external and internal fa...
research
03/17/2020

Improving predictions by nonlinear regression models from outlying input data

When applying machine learning/statistical methods to the environmental ...

Please sign up or login with your details

Forgot password? Click here to reset