OneLog: Towards End-to-End Training in Software Log Anomaly Detection

04/15/2021
by   Shayan Hashemi, et al.
0

In recent years, with the growth of online services and IoT devices, software log anomaly detection has become a significant concern for both academia and industry. However, at the time of writing this paper, almost all contributions to the log anomaly detection task, follow the same traditional architecture based on parsing, vectorizing, and classifying. This paper proposes OneLog, a new approach that uses a large deep model based on instead of multiple small components. OneLog utilizes a character-based convolutional neural network (CNN) originating from traditional NLP tasks. This allows the model to take advantage of multiple datasets at once and take advantage of numbers and punctuations, which were removed in previous architectures. We evaluate OneLog using four open data sets Hadoop Distributed File System (HDFS), BlueGene/L (BGL), Hadoop, and OpenStack. We evaluate our model with single and multi-project datasets. Additionally, we evaluate robustness with synthetically evolved datasets and ahead-of-time anomaly detection test that indicates capabilities to predict anomalies before occurring. To the best of our knowledge, our multi-project model outperforms state-of-the-art methods in HDFS, Hadoop, and BGL datasets, respectively setting getting F1 scores of 99.99, 99.99, and 99.98. However, OneLog's performance on the Openstack is unsatisfying with F1 score of only 21.18. Furthermore, Onelogs performance suffers very little from noise showing F1 scores of 99.95, 99.92, and 99.98 in HDFS, Hadoop, and BGL.

READ FULL TEXT
research
02/02/2021

Detecting Anomalies in Software Execution Logs with Siamese Network

Logs are semi-structured text files that represent software's execution ...
research
05/24/2023

Beyond Individual Input for Deep Anomaly Detection on Tabular Data

Anomaly detection is crucial in various domains, such as finance, health...
research
09/11/2021

Towards a Rigorous Evaluation of Time-series Anomaly Detection

In recent years, proposed studies on time-series anomaly detection (TAD)...
research
09/20/2021

A2Log: Attentive Augmented Log Anomaly Detection

Anomaly detection becomes increasingly important for the dependability a...
research
03/03/2018

Analyzing Business Process Anomalies Using Autoencoders

Businesses are naturally interested in detecting anomalies in their inte...
research
03/17/2019

Learning Competitive and Discriminative Reconstructions for Anomaly Detection

Most of the existing methods for anomaly detection use only positive dat...
research
04/08/2020

Comparison of Evolving Granular Classifiers applied to Anomaly Detection for Predictive Maintenance in Computing Centers

Log-based predictive maintenance of computing centers is a main concern ...

Please sign up or login with your details

Forgot password? Click here to reset