Detecting Anomalies in Software Execution Logs with Siamese Network

02/02/2021
by   Shayan Hashemi, et al.
0

Logs are semi-structured text files that represent software's execution paths and states during its run-time. Therefore, detecting anomalies in software logs reflect anomalies in the software's execution path or state. So, it has become a notable concern in software engineering. We use LSTM like many prior works, and on top of LSTM, we propose a novel anomaly detection approach based on the Siamese network. This paper also provides an authentic validation of the approach on the Hadoop Distributed File System (HDFS) log dataset. To the best of our knowledge, the proposed approach outperforms other methods on the same dataset at the F1 score of 0.996, resulting in a new state-of-the-art performance on the dataset. Along with the primary method, we introduce a novel training pair generation algorithm that reduces generated training pairs by the factor of 3000 while maintaining the F1 score, merely a modest decay from 0.996 to 0.995. Additionally, we propose a hybrid model by combining the Siamese network with a traditional feedforward neural network to make end-to-end training possible, reducing engineering effort in setting up a deep-learning-based log anomaly detector. Furthermore, we examine our method's robustness to log evolutions by evaluating the model on synthetically evolved log sequences; we got the F1 score of 0.95 at the noise ratio of 20 we dive deep into some of the side benefits of the Siamese network. Accordingly, we introduce a method of monitoring the evolutions of logs without label requirements at run-time. Additionally, we present a visualization technique that facilitates human administrations of log anomaly detection.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/15/2021

OneLog: Towards End-to-End Training in Software Log Anomaly Detection

In recent years, with the growth of online services and IoT devices, sof...
research
10/11/2022

Digitization of Raster Logs: A Deep Learning Approach

Raster well-log images are digital representations of well-logs data gen...
research
03/03/2018

Analyzing Business Process Anomalies Using Autoencoders

Businesses are naturally interested in detecting anomalies in their inte...
research
02/14/2023

Heterogeneous Anomaly Detection for Software Systems via Semi-supervised Cross-modal Attention

Prompt and accurate detection of system anomalies is essential to ensure...
research
08/18/2023

AutoLog: A Log Sequence Synthesis Framework for Anomaly Detection

The rapid progress of modern computing systems has led to a growing inte...
research
02/18/2022

Pinpointing Anomaly Events in Logs from Stability Testing – N-Grams vs. Deep-Learning

As stability testing execution logs can be very long, software engineers...
research
12/23/2021

SemParser: A Semantic Parser for Log Analysis

Logs, being run-time information automatically generated by software, re...

Please sign up or login with your details

Forgot password? Click here to reset