Constructing Large-Scale Real-World Benchmark Datasets for AIOps

08/08/2022
by   Zeyan Li, et al.
0

Recently, AIOps (Artificial Intelligence for IT Operations) has been well studied in academia and industry to enable automated and effective software service management. Plenty of efforts have been dedicated to AIOps, including anomaly detection, root cause localization, incident management, etc. However, most existing works are evaluated on private datasets, so their generality and real performance cannot be guaranteed. The lack of public large-scale real-world datasets has prevented researchers and engineers from enhancing the development of AIOps. To tackle this dilemma, in this work, we introduce three public real-world, large-scale datasets about AIOps, mainly aiming at KPI anomaly detection, root cause localization on multi-dimensional data, and failure discovery and diagnosis. More importantly, we held three competitions in 2018/2019/2020 based on these datasets, attracting thousands of teams to participate. In the future, we will continue to publish more datasets and hold competitions to promote the development of AIOps further.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/08/2021

ZYELL-NCTU NetTraffic-1.0: A Large-Scale Dataset for Real-World Network Anomaly Detection

Network security has been an active research topic for long. One critica...
research
09/20/2020

Out-Of-Bag Anomaly Detection

Data anomalies are ubiquitous in real world datasets, and can have an ad...
research
02/10/2023

Eadro: An End-to-End Troubleshooting Framework for Microservices on Multi-source Data

The complexity and dynamism of microservices pose significant challenges...
research
05/05/2023

Generic and Robust Root Cause Localization for Multi-Dimensional Data in Online Service Systems

Localizing root causes for multi-dimensional data is critical to ensure ...
research
12/10/2021

Multimedia Datasets for Anomaly Detection: A Review

Multimedia anomaly datasets play a crucial role in automated surveillanc...
research
01/31/2023

BALANCE: Bayesian Linear Attribution for Root Cause Localization

Root Cause Analysis (RCA) plays an indispensable role in distributed dat...
research
05/20/2022

RiskLoc: Localization of Multi-dimensional Root Causes by Weighted Risk

Failures and anomalies in large-scale software systems are unavoidable i...

Please sign up or login with your details

Forgot password? Click here to reset