Log-based Anomaly Detection Without Log Parsing

08/04/2021
by   Van-Hoang Le, et al.
0

Software systems often record important runtime information in system logs for troubleshooting purposes. There have been many studies that use log data to construct machine learning models for detecting system anomalies. Through our empirical study, we find that existing log-based anomaly detection approaches are significantly affected by log parsing errors that are introduced by 1) OOV (out-of-vocabulary) words, and 2) semantic misunderstandings. The log parsing errors could cause the loss of important information for anomaly detection. To address the limitations of existing methods, we propose NeuralLog, a novel log-based anomaly detection approach that does not require log parsing. NeuralLog extracts the semantic meaning of raw log messages and represents them as semantic vectors. These representation vectors are then used to detect anomalies through a Transformer-based classification model, which can capture the contextual information from log sequences. Our experimental results show that the proposed approach can effectively understand the semantic meaning of log messages and achieve accurate anomaly detection results. Overall, NeuralLog achieves F1-scores greater than 0.95 on four public datasets, outperforming the existing approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/25/2023

Impact of Log Parsing on Log-based Anomaly Detection

Software systems log massive amounts of data, recording important runtim...
research
02/23/2021

Robust and Transferable Anomaly Detection in Log Data using Pre-Trained Language Models

Anomalies or failures in large computer systems, such as the cloud, have...
research
08/20/2019

CBOWRA: A Representation Learning Approach for Medication Anomaly Detection

Electronic health record is an important source for clinical researches ...
research
07/08/2022

Encoding NetFlows for State-Machine Learning

NetFlow data is a well-known network log format used by many network ana...
research
03/17/2020

Self-Supervised Log Parsing

Logs are extensively used during the development and maintenance of soft...
research
08/18/2023

AutoLog: A Log Sequence Synthesis Framework for Anomaly Detection

The rapid progress of modern computing systems has led to a growing inte...
research
06/22/2022

Heterogeneous Anomaly Detection for Software Systems via Attentive Multi-modal Learning

Prompt and accurate detection of system anomalies is essential to ensure...

Please sign up or login with your details

Forgot password? Click here to reset