ClusterLog: Clustering Logs for Effective Log-based Anomaly Detection

01/19/2023
by   Chris Egersdoerfer, et al.
0

With the increasing prevalence of scalable file systems in the context of High Performance Computing (HPC), the importance of accurate anomaly detection on runtime logs is increasing. But as it currently stands, many state-of-the-art methods for log-based anomaly detection, such as DeepLog, have encountered numerous challenges when applied to logs from many parallel file systems (PFSes), often due to their irregularity and ambiguity in time-based log sequences. To circumvent these problems, this study proposes ClusterLog, a log pre-processing method that clusters the temporal sequence of log keys based on their semantic similarity. By grouping semantically and sentimentally similar logs, this approach aims to represent log sequences with the smallest amount of unique log keys, intending to improve the ability of a downstream sequence-based model to effectively learn the log patterns. The preliminary results of ClusterLog indicate not only its effectiveness in reducing the granularity of log sequences without the loss of important sequence information but also its generalizability to different file systems' logs.

READ FULL TEXT
research
01/07/2021

Detecting Log Anomalies with Multi-Head Attention (LAMA)

Anomaly detection is a crucial and challenging subject that has been stu...
research
05/18/2018

Extending Dynamic Bayesian Networks for Anomaly Detection in Complex Logs

Checking various log files from different processes can be a tedious tas...
research
07/05/2019

A Pvalue-guided Anomaly Detection Approach Combining Multiple Heterogeneous Log Parser Algorithms on IIoT Systems

Industrial Internet of Things (IIoT) is becoming an attack target of adv...
research
02/17/2022

A recommender system for automatic picking of subsurface formation tops

Geoscience domain experts traditionally correlate formation tops in the ...
research
06/11/2019

Anomaly Detection in High Performance Computers: A Vicinity Perspective

In response to the demand for higher computational power, the number of ...
research
08/18/2023

AutoLog: A Log Sequence Synthesis Framework for Anomaly Detection

The rapid progress of modern computing systems has led to a growing inte...
research
02/18/2022

Pinpointing Anomaly Events in Logs from Stability Testing – N-Grams vs. Deep-Learning

As stability testing execution logs can be very long, software engineers...

Please sign up or login with your details

Forgot password? Click here to reset