Hue: A User-Adaptive Parser for Hybrid Logs

08/14/2023
by   Junjielong Xu, et al.
0

Log parsing, which extracts log templates from semi-structured logs and produces structured logs, is the first and the most critical step in automated log analysis. While existing log parsers have achieved decent results, they suffer from two major limitations by design. First, they do not natively support hybrid logs that consist of both single-line logs and multi-line logs (Java Exception and Hadoop Counters). Second, they fall short in integrating domain knowledge in parsing, making it hard to identify ambiguous tokens in logs. This paper defines a new research problem, hybrid log parsing, as a superset of traditional log parsing tasks, and proposes Hue, the first attempt for hybrid log parsing via a user-adaptive manner. Specifically, Hue converts each log message to a sequence of special wildcards using a key casting table and determines the log types via line aggregating and pattern extracting. In addition, Hue can effectively utilize user feedback via a novel merge-reject strategy, making it possible to quickly adapt to complex and changing log templates. We evaluated Hue on three hybrid log datasets and sixteen widely-used single-line log datasets (Loghub). The results show that Hue achieves an average grouping accuracy of 0.845 on hybrid logs, which largely outperforms the best results (0.563 on average) obtained by existing parsers. Hue also exhibits SOTA performance on single-line log datasets. Furthermore, Hue has been successfully deployed in a real production environment for daily hybrid log parsing.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/07/2020

Logram: Efficient Log Parsing Using n-Gram Dictionaries

Software systems usually record important runtime information in their l...
research
07/19/2023

Prompting for Automatic Log Template Extraction

Log parsing, the initial and vital stage in automated log analysis, invo...
research
09/14/2021

GPT-2C: A GPT-2 parser for Cowrie honeypot logs

Deception technologies like honeypots produce comprehensive log reports,...
research
10/29/2021

AWSOM-LP: An Effective Log Parsing Technique Using Pattern Recognition and Frequency Analysis

Logs provide users with useful insights to help with a variety of develo...
research
06/06/2023

A Novel Approach To User Agent String Parsing For Vulnerability Analysis Using Mutli-Headed Attention

The increasing reliance on the internet has led to the proliferation of ...
research
12/23/2021

SemParser: A Semantic Parser for Log Analysis

Logs, being run-time information automatically generated by software, re...
research
02/13/2019

Delog: A Privacy Preserving Log Filtering Framework for Online Compute Platforms

In many software applications, logs serve as the only interface between ...

Please sign up or login with your details

Forgot password? Click here to reset