EvLog: Evolving Log Analyzer for Anomalous Logs Identification

06/02/2023
by   Yintong Huo, et al.
0

Software logs record system activities, aiding maintainers in identifying the underlying causes for failures and enabling prompt mitigation actions. However, maintainers need to inspect a large volume of daily logs to identify the anomalous logs that reveal failure details for further diagnosis. Thus, how to automatically distinguish these anomalous logs from normal logs becomes a critical problem. Existing approaches alleviate the burden on software maintainers, but they are built upon an improper yet critical assumption: logging statements in the software remain unchanged. While software keeps evolving, our empirical study finds that evolving software brings three challenges: log parsing errors, evolving log events, and unstable log sequences. In this paper, we propose a novel unsupervised approach named Evolving Log analyzer (EvLog) to mitigate these challenges. We first build a multi-level representation extractor to process logs without parsing to prevent errors from the parser. The multi-level representations preserve the essential semantics of logs while leaving out insignificant changes in evolving events. EvLog then implements an anomaly discriminator with an attention mechanism to identify the anomalous logs and avoid the issue brought by the unstable sequence. EvLog has shown effectiveness in two real-world system evolution log datasets with an average F1 score of 0.955 and 0.847 in the intra-version setting and inter-version setting, respectively, which outperforms other state-of-the-art approaches by a wide margin. To our best knowledge, this is the first study on tackling anomalous logs over software evolution. We believe our work sheds new light on the impact of software evolution with the corresponding solutions for the log analysis community.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/15/2020

A Survey on Automated Log Analysis for Reliability Engineering

Logs are semi-structured text generated by logging statements in softwar...
research
04/15/2021

Automated Evolution of Feature Logging Statement Levels Using Git Histories and Degree of Interest

Logging – used for system events and security breaches to more informati...
research
08/21/2023

A Large-scale Benchmark for Log Parsing

Log data is pivotal in activities like anomaly detection and failure dia...
research
03/18/2022

Active Meta-Learner for Log Analysis

The analysis of logs is a vital activity undertaken for cyber investigat...
research
10/11/2022

Digitization of Raster Logs: A Deep Learning Approach

Raster well-log images are digital representations of well-logs data gen...
research
10/25/2017

Deep Convolutional Neural Networks for Anomaly Event Classification on Distributed Systems

The increasing popularity of server usage has brought a plenty of anomal...
research
01/08/2020

Comparing Constraints Mined From Execution Logs to Understand Software Evolution

Complex software systems evolve frequently, e.g., when introducing new f...

Please sign up or login with your details

Forgot password? Click here to reset