Effective Removal of Operational Log Messages: an Application to Model Inference

04/15/2020
by   Donghwan Shin, et al.
0

Model inference aims to extract accurate models from the execution logs of software systems. However, in reality, logs may contain some "noise" that could deteriorate the performance of model inference. One form of noise can commonly be found in system logs that contain not only transactional messages—logging the functional behavior of the system—but also operational messages—recording the operational state of the system (e.g., a periodic heartbeat to keep track of the memory usage). In low-quality logs, transactional and operational messages are randomly interleaved, leading to the erroneous inclusion of operational behaviors into a system model, that ideally should only reflect the functional behavior of the system. It is therefore important to remove operational messages in the logs before inferring models. In this paper, we propose LogCleaner, a novel technique for removing operational logs messages. LogCleaner first performs a periodicity analysis to filter out periodic messages, and then it performs a dependency analysis to calculate the degree of dependency for all log messages and to remove operational messages based on their dependencies. The experimental results on two proprietary and 11 publicly available log datasets show that LogCleaner, on average, can accurately remove 98 operational messages and preserve 81 Furthermore, using logs pre-processed with LogCleaner decreases the execution time of model inference (with a speed-up ranging from 1.5 to 946.7 depending on the characteristics of the system) and significantly improves the accuracy of the inferred models, by increasing their ability to accept correct system behaviors (+43.8 pp on average, with pp=percentage points) and to reject incorrect system behaviors (+15.0 pp on average).

READ FULL TEXT
research
06/03/2021

PRINS: Scalable Model Inference for Component-based System Logs

Behavioral software models play a key role in many software engineering ...
research
08/06/2019

Scalable Inference of System-level Models from Component Logs

Behavioral software models play a key role in many software engineering ...
research
08/17/2023

Are They All Good? Studying Practitioners' Expectations on the Readability of Log Messages

Developers write logging statements to generate logs that provide run-ti...
research
07/04/2018

Mining Periodic Patterns with a MDL Criterion

The quantity of event logs available is increasing rapidly, be they prod...
research
02/14/2022

vue4logs – Automatic Structuring of Heterogeneous Computer System Logs

Computer system log data is commonly used in system monitoring, performa...
research
07/17/2023

2P-BFT-Log: 2-Phase Single-Author Append-Only Log for Adversarial Environments

Replicated append-only logs sequentially order messages from the same au...
research
08/24/2023

Uncertainty and Explainable Analysis of Machine Learning Model for Reconstruction of Sonic Slowness Logs

Logs are valuable information for oil and gas fields as they help to det...

Please sign up or login with your details

Forgot password? Click here to reset