AWSOM-LP: An Effective Log Parsing Technique Using Pattern Recognition and Frequency Analysis

10/29/2021
by   Issam Sedki, et al.
0

Logs provide users with useful insights to help with a variety of development and operations tasks. The problem is that logs are often unstructured, making their analysis a complex task. This is mainly due to the lack of guidelines and best practices for logging, combined with a large number of logging libraries at the disposal of software developers. There exist studies that aim to parse automatically large logs. The main objective is to extract templates from samples of log data that are used to recognize future logs. In this paper, we propose AWSOM-LP, a powerful log parsing and abstraction tool, which is highly accurate, stable, and efficient. AWSOM-LP is built on the idea of applying pattern recognition and frequency analysis. First, log events are organized into patterns using a simple text processing method. Frequency analysis is then applied locally to instances of the same group to identify static and dynamic content of log events. When applied to 16 log datasets of the the LogPai project, AWSOM-LP achieves an average grouping accuracy of 93.5 outperforms the accuracy of five leading log parsing tools namely, Logram, Lenma, Drain, IPLoM and AEL. Additionally, AWSOM-LP can generate more than 80 of the final log templates from 10 parse up to a million log events in an average time of 5 minutes. AWSOM-LP is available online as an open source. It can be used by practitioners and researchers to parse effectively and efficiently large log files so as to support log analysis tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/07/2020

Logram: Efficient Log Parsing Using n-Gram Dictionaries

Software systems usually record important runtime information in their l...
research
08/14/2023

Hue: A User-Adaptive Parser for Hybrid Logs

Log parsing, which extracts log templates from semi-structured logs and ...
research
04/22/2023

Did We Miss Something Important? Studying and Exploring Variable-Aware Log Abstraction

Due to the sheer size of software logs, developers rely on automated tec...
research
03/17/2020

Self-Supervised Log Parsing

Logs are extensively used during the development and maintenance of soft...
research
09/14/2021

GPT-2C: A GPT-2 parser for Cowrie honeypot logs

Deception technologies like honeypots produce comprehensive log reports,...
research
02/15/2022

Documentation based Semantic-Aware Log Parsing

With the recent advances of deep learning techniques, there are rapidly ...
research
12/16/2020

Summarizing Unstructured Logs in Online Services

Logs are one of the most valuable data sources for managing large-scale ...

Please sign up or login with your details

Forgot password? Click here to reset