Benchmarking datasets for Anomaly-based Network Intrusion Detection: KDD CUP 99 alternatives

11/13/2018
by   Abhishek Divekar, et al.
0

Machine Learning has been steadily gaining traction for its use in Anomaly-based Network Intrusion Detection Systems (A-NIDS). Research into this domain is frequently performed using the KDD CUP 99 dataset as a benchmark. Several studies question its usability while constructing a contemporary NIDS, due to the skewed response distribution, non-stationarity, and failure to incorporate modern attacks. In this paper, we compare the performance for KDD-99 alternatives when trained using classification models commonly found in literature: Neural Network, Support Vector Machine, Decision Tree, Random Forest, Naive Bayes and K-Means. Applying the SMOTE oversampling technique and random undersampling, we create a balanced version of NSL-KDD and prove that skewed target classes in KDD-99 and NSL-KDD hamper the efficacy of classifiers on minority classes (U2R and R2L), leading to possible security risks. We explore UNSW-NB15, a modern substitute to KDD-99 with greater uniformity of pattern distribution. We benchmark this dataset before and after SMOTE oversampling to observe the effect on minority performance. Our results indicate that classifiers trained on UNSW-NB15 match or better the Weighted F1-Score of those trained on NSL-KDD and KDD-99 in the binary case, thus advocating UNSW-NB15 as a modern substitute to these datasets.

READ FULL TEXT

page 1

page 4

research
05/07/2018

Improving Network Intrusion Detection Classifiers by Non-payload-Based Exploit-Independent Obfuscations: An Adversarial Approach

Machine-learning based intrusion detection classifiers are able to detec...
research
05/10/2021

ADASYN-Random Forest Based Intrusion Detection Model

Intrusion detection has been a key topic in the field of cyber security,...
research
07/07/2022

Bayesian Hyperparameter Optimization for Deep Neural Network-Based Network Intrusion Detection

Traditional network intrusion detection approaches encounter feasibility...
research
05/14/2021

Anomaly Detection in Cybersecurity: Unsupervised, Graph-Based and Supervised Learning Methods in Adversarial Environments

Machine learning for anomaly detection has become a widely researched fi...
research
03/18/2018

Towards an Efficient Anomaly-Based Intrusion Detection for Software-Defined Networks

Software-defined networking (SDN) is a new paradigm that allows developi...
research
04/18/2019

Intrusion Detection Mechanism Using Fuzzy Rule Interpolation

Fuzzy Rule Interpolation (FRI) methods can serve deducible (interpolated...
research
03/13/2021

Image Classifiers for Network Intrusions

This research recasts the network attack dataset from UNSW-NB15 as an in...

Please sign up or login with your details

Forgot password? Click here to reset