Linnaeus: A highly reusable and adaptable ML based log classification pipeline

03/11/2021
by   Armin Catovic, et al.
0

Logs are a common way to record detailed run-time information in software. As modern software systems evolve in scale and complexity, logs have become indispensable to understanding the internal states of the system. At the same time however, manually inspecting logs has become impractical. In recent times, there has been more emphasis on statistical and machine learning (ML) based methods for analyzing logs. While the results have shown promise, most of the literature focuses on algorithms and state-of-the-art (SOTA), while largely ignoring the practical aspects. In this paper we demonstrate our end-to-end log classification pipeline, Linnaeus. Besides showing the more traditional ML flow, we also demonstrate our solutions for adaptability and re-use, integration towards large scale software development processes, and how we cope with lack of labelled data. We hope Linnaeus can serve as a blueprint for, and inspire the integration of, various ML based solutions in other large scale industrial settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/06/2023

SoK: Machine Learning for Continuous Integration

Continuous Integration (CI) has become a well-established software devel...
research
05/29/2021

Log2NS: Enhancing Deep Learning Based Analysis of Logs With Formal to Prevent Survivorship Bias

Analysis of large observational data sets generated by a reactive system...
research
07/13/2021

Experience Report: Deep Learning-based System Log Analysis for Anomaly Detection

Logs have been an imperative resource to ensure the reliability and cont...
research
09/24/2019

Logzip: Extracting Hidden Structures via Iterative Clustering for Log Compression

System logs record detailed runtime information of software systems and ...
research
08/06/2019

Scalable Inference of System-level Models from Component Logs

Behavioral software models play a key role in many software engineering ...
research
01/24/2020

Software Logging for Machine Learning

System logs perform a critical function in software-intensive systems as...
research
03/21/2023

LogQA: Question Answering in Unstructured Logs

Modern systems produce a large volume of logs to record run-time status ...

Please sign up or login with your details

Forgot password? Click here to reset