A Comparative Study of Network Traffic Representations for Novelty Detection

06/30/2020
by   Kun Yang, et al.
0

Data representation plays a critical role in the performance of novelty detection methods from machine learning (ML). Network traffic has conventionally posed many challenges to conventional anomaly detection, due to the inherent diversity of network traffic. Even within a single network, the most fundamental characteristics can change; this variability is fundamental to network traffic but especially true in the Internet of Things (IoT), where the network hosts a wide array of devices, each of which behaves differently, exhibiting high variance in both operational modalities and network activity patterns. Although there are established ways to study the effects of data representation in supervised learning, the problem is particularly challenging and understudied in the unsupervised learning context, where there is no standard way to evaluate the effect of selected features and representations at training time. This work explores different data representations for novelty detection in the Internet of Things, studying the effect of different representations of network traffic flows on the performance of a wide range of machine learning algorithms for novelty detection for problems arising in IoT, including malware detection, the detection of rogue devices, and the detection of cyberphysical anomalies. We find that no single representation works best (in terms of area under the curve) across devices or ML methods, yet the following features consistently improve the performance of novelty detection algorithms: (1) traffic sizes, (i.e., packet sizes rather than number of packets in volume-based representations); and (2) packet header fields (i.e., TTL, TCP flags).

READ FULL TEXT

page 11

page 18

page 19

page 20

page 23

page 25

page 27

research
01/09/2023

Efficient Attack Detection in IoT Devices using Feature Engineering-Less Machine Learning

Through the generalization of deep learning, the research community has ...
research
04/22/2021

An Efficient One-Class SVM for Anomaly Detection in the Internet of Things

Insecure Internet of things (IoT) devices pose significant threats to cr...
research
09/09/2021

Detecting Attacks on IoT Devices using Featureless 1D-CNN

The generalization of deep learning has helped us, in the past, address ...
research
08/12/2020

Learning to Detect Anomalous Wireless Links in IoT Networks

After decades of research, Internet of Things (IoT) is finally permeatin...
research
02/01/2019

System Design Considerations For Internet Of Things (IoT) With Category-M Devices In LTE Networks

Successful network deployment of the Internet of Things (IoT) requires m...
research
02/04/2020

Machine Learning Methods for Monitoring of Quasi-Periodic Traffic in Massive IoT Networks

One of the central problems in massive Internet of Things (IoT) deployme...
research
10/27/2020

Beyond Accuracy: Cost-Aware Data Representation Exploration for Network Traffic Model Performance

In this paper, we explore how different representations of network traff...

Please sign up or login with your details

Forgot password? Click here to reset