AnomalyBench: An Open Benchmark for Explainable Anomaly Detection

10/10/2020
by   Vincent Jacob, et al.
9

Access to high-quality data repositories and benchmarks have been instrumental in advancing the state of the art in many domains, as they provide the research community a common ground for training, testing, evaluating, comparing, and experimenting with novel machine learning models. Lack of such community resources for anomaly detection (AD) severely limits progress. In this report, we present AnomalyBench, the first comprehensive benchmark for explainable AD over high-dimensional (2000+) time series data. AnomalyBench has been systematically constructed based on real data traces from  100 repeated executions of 10 large-scale stream processing jobs on a Spark cluster. 30+ of these executions were disturbed by introducing  100 instances of different types of anomalous events (e.g., misbehaving inputs, resource contention, process failures). For each of these anomaly instances, ground truth labels for the root-cause interval as well as those for the effect interval are available, providing a means for supporting both AD tasks and explanation discovery (ED) tasks via root-cause analysis. We demonstrate the key design features and practical utility of AnomalyBench through an experimental study with three state-of-the-art semi-supervised AD techniques.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/22/2022

NLP Based Anomaly Detection for Categorical Time Series

Identifying anomalies in large multi-dimensional time series is a crucia...
research
10/13/2022

A Survey on Explainable Anomaly Detection

In the past two decades, most research on anomaly detection has focused ...
research
07/03/2020

Explainable Deep One-Class Classification

Deep one-class classification variants for anomaly detection learn a map...
research
12/16/2021

The MVTec 3D-AD Dataset for Unsupervised 3D Anomaly Detection and Localization

We introduce the first comprehensive 3D dataset for the task of unsuperv...
research
11/16/2021

HiRID-ICU-Benchmark – A Comprehensive Machine Learning Benchmark on High-resolution ICU Data

The recent success of machine learning methods applied to time series co...
research
07/25/2020

Improving Robustness on Seasonality-Heavy Multivariate Time Series Anomaly Detection

Robust Anomaly Detection (AD) on time series data is a key component for...
research
05/31/2023

Quality In / Quality Out: Assessing Data quality in an Anomaly Detection Benchmark

Autonomous or self-driving networks are expected to provide a solution t...

Please sign up or login with your details

Forgot password? Click here to reset