Log Parsing Evaluation in the Era of Modern Software Systems

08/17/2023
by   Stefan Petrescu, et al.
0

Due to the complexity and size of modern software systems, the amount of logs generated is tremendous. Hence, it is infeasible to manually investigate these data in a reasonable time, thereby requiring automating log analysis to derive insights about the functioning of the systems. Motivated by an industry use-case, we zoom-in on one integral part of automated log analysis, log parsing, which is the prerequisite to deriving any insights from logs. Our investigation reveals problematic aspects within the log parsing field, particularly its inefficiency in handling heterogeneous real-world logs. We show this by assessing the 14 most-recognized log parsing approaches in the literature using (i) nine publicly available datasets, (ii) one dataset comprised of combined publicly available data, and (iii) one dataset generated within the infrastructure of a large bank. Subsequently, toward improving log parsing robustness in real-world production scenarios, we propose a tool, Logchimera, that enables estimating log parsing performance in industry contexts through generating synthetic log data that resemble industry logs. Our contributions serve as a foundation to consolidate past research efforts, facilitate future research advancements, and establish a strong link between research and industry log parsing.

READ FULL TEXT
research
11/08/2018

Tools and Benchmarks for Automated Log Parsing

Logs are imperative in the development and maintenance process of many s...
research
02/12/2021

On Automatic Parsing of Log Records

Software log analysis helps to maintain the health of software solutions...
research
04/24/2023

USTEP: Structuration des logs en flux grâce à un arbre de recherche évolutif

Logs record valuable system information at runtime. They are widely used...
research
08/14/2020

Loghub: A Large Collection of System Log Datasets towards Automated Log Analytics

Logs have been widely adopted in software system development and mainten...
research
12/29/2022

System Log Parsing: A Survey

Modern information and communication systems have become increasingly ch...
research
06/06/2023

A Novel Approach To User Agent String Parsing For Vulnerability Analysis Using Mutli-Headed Attention

The increasing reliance on the internet has led to the proliferation of ...
research
12/16/2020

Summarizing Unstructured Logs in Online Services

Logs are one of the most valuable data sources for managing large-scale ...

Please sign up or login with your details

Forgot password? Click here to reset