LAnoBERT : System Log Anomaly Detection based on BERT Masked Language Model

11/18/2021
by   Yukyung Lee, et al.
14

The system log generated in a computer system refers to large-scale data that are collected simultaneously and used as the basic data for determining simple errors and detecting external adversarial intrusion or the abnormal behaviors of insiders. The aim of system log anomaly detection is to promptly identify anomalies while minimizing human intervention, which is a critical problem in the industry. Previous studies performed anomaly detection through algorithms after converting various forms of log data into a standardized template using a parser. These methods involved generating a template for refining the log key. Particularly, a template corresponding to a specific event should be defined in advance for all the log data using which the information within the log key may get lost.In this study, we propose LAnoBERT, a parser free system log anomaly detection method that uses the BERT model, exhibiting excellent natural language processing performance. The proposed method, LAnoBERT, learns the model through masked language modeling, which is a BERT-based pre-training method, and proceeds with unsupervised learning-based anomaly detection using the masked language modeling loss function per log key word during the inference process. LAnoBERT achieved better performance compared to previous methodology in an experiment conducted using benchmark log datasets, HDFS, and BGL, and also compared to certain supervised learning-based models.

READ FULL TEXT
research
09/03/2023

LogGPT: Exploring ChatGPT for Log-Based Anomaly Detection

The increasing volume of log data produced by software-intensive systems...
research
12/21/2022

LogAnMeta: Log Anomaly Detection Using Meta Learning

Modern telecom systems are monitored with performance and system logs fr...
research
07/05/2019

A Pvalue-guided Anomaly Detection Approach Combining Multiple Heterogeneous Log Parser Algorithms on IIoT Systems

Industrial Internet of Things (IIoT) is becoming an attack target of adv...
research
02/23/2021

Robust and Transferable Anomaly Detection in Log Data using Pre-Trained Language Models

Anomalies or failures in large computer systems, such as the cloud, have...
research
08/24/2019

A framework for anomaly detection using language modeling, and its applications to finance

In the finance sector, studies focused on anomaly detection are often as...
research
07/02/2021

A Collective Anomaly Detection Method Over Bitcoin Network

The popularity and amazing attractiveness of cryptocurrencies, and espec...
research
12/02/2017

Recurrent Neural Network Language Models for Open Vocabulary Event-Level Cyber Anomaly Detection

Automated analysis methods are crucial aids for monitoring and defending...

Please sign up or login with your details

Forgot password? Click here to reset