Rare Yet Popular: Evidence and Implications from Labeled Datasets for Network Anomaly Detection

11/18/2022
by   Jose Manuel Navarro, et al.
0

Anomaly detection research works generally propose algorithms or end-to-end systems that are designed to automatically discover outliers in a dataset or a stream. While literature abounds concerning algorithms or the definition of metrics for better evaluation, the quality of the ground truth against which they are evaluated is seldom questioned. In this paper, we present a systematic analysis of available public (and additionally our private) ground truth for anomaly detection in the context of network environments, where data is intrinsically temporal, multivariate and, in particular, exhibits spatial properties, which, to the best of our knowledge, we are the first to explore. Our analysis reveals that, while anomalies are, by definition, temporally rare events, their spatial characterization clearly shows some type of anomalies are significantly more popular than others. We find that simple clustering can reduce the need for human labeling by a factor of 2x-10x, that we are first to quantitatively analyze in the wild.

READ FULL TEXT

page 4

page 7

research
07/04/2021

A Typology of Data Anomalies

Anomalies are cases that are in some way unusual and do not appear to fi...
research
07/23/2021

HURRA! Human readable router anomaly detection

This paper presents HURRA, a system that aims to reduce the time spent b...
research
12/22/2017

Grand Challenge: Optimized Stage Processing for Anomaly Detection on Numerical Data Streams

The 2017 Grand Challenge focused on the problem of automatic detection o...
research
03/23/2021

Anomaly detection using principles of human perception

In the fields of statistics and unsupervised machine learning a fundamen...
research
05/13/2022

A Vision Inspired Neural Network for Unsupervised Anomaly Detection in Unordered Data

A fundamental problem in the field of unsupervised machine learning is t...
research
06/27/2022

Local Evaluation of Time Series Anomaly Detection Algorithms

In recent years, specific evaluation metrics for time series anomaly det...

Please sign up or login with your details

Forgot password? Click here to reset