Augmentation Scheme for Dealing with Imbalanced Network Traffic Classification Using Deep Learning

01/01/2019
by   Ramin Hasibi, et al.
0

One of the most important tasks in network management is identifying different types of traffic flows. As a result, a type of management service, called Network Traffic Classifier (NTC), has been introduced. One type of NTCs that has gained huge attention in recent years applies deep learning on packets in order to classify flows. Internet is an imbalanced environment i.e., some classes of applications are a lot more populated than others e.g., HTTP. Additionally, one of the challenges in deep learning methods is that they do not perform well in imbalanced environments in terms of evaluation metrics such as precision, recall, and F_1 measure. In order to solve this problem, we recommend the use of augmentation methods to balance the dataset. In this paper, we propose a novel data augmentation approach based on the use of Long Short Term Memory (LSTM) networks for generating traffic flow patterns and Kernel Density Estimation (KDE) for replicating the numerical features of each class. First, we use the LSTM network in order to learn and generate the sequence of packets in a flow for classes with less population. Then, we complete the features of the sequence with generating random values based on the distribution of a certain feature, which will be estimated using KDE. Finally, we compare the training of a Convolutional Recurrent Neural Network (CRNN) in large-scale imbalanced, sampled, and augmented datasets. The contribution of our augmentation scheme is then evaluated on all of the datasets through measurements of precision, recall, and F1 measure for every class of application. The results demonstrate that our scheme is well suited for network traffic flow datasets and improves the performance of deep learning algorithms when it comes to above-mentioned metrics.

READ FULL TEXT

page 1

page 5

page 6

research
01/07/2018

Deep Bidirectional and Unidirectional LSTM Recurrent Neural Network for Network-wide Traffic Speed Prediction

Short-term traffic forecasting based on deep learning methods, especiall...
research
05/09/2018

Sequence Aggregation Rules for Anomaly Detection in Computer Network Traffic

We evaluate methods for applying unsupervised anomaly detection to cyber...
research
05/09/2022

Transfer Learning Based Efficient Traffic Prediction with Limited Training Data

Efficient prediction of internet traffic is an essential part of Self Or...
research
07/13/2022

Efficient Augmentation for Imbalanced Deep Learning

Deep learning models memorize training data, which hurts their ability t...
research
07/10/2021

Practical and Configurable Network Traffic Classification Using Probabilistic Machine Learning

Network traffic classification that is widely applicable and highly accu...
research
08/03/2021

Automatic classification of eclipsing binary stars using deep learning methods

In the last couple of decades, tremendous progress has been achieved in ...
research
01/10/2020

Classification of Traffic Using Neural Networks by Rejecting: a Novel Approach in Classifying VPN Traffic

Traffic flows are set of packets transferring between a client and a ser...

Please sign up or login with your details

Forgot password? Click here to reset