DistAD: Software Anomaly Detection Based on Execution Trace Distribution

02/28/2022
by   Shiyi Kong, et al.
0

Modern software systems have become increasingly complex, which makes them difficult to test and validate. Detecting software partial anomalies in complex systems at runtime can assist with handling unintended software behaviors, avoiding catastrophic software failures and improving software runtime availability. These detection techniques aim to identify the manifestation of faults (anomalies) before they ultimately lead to unavoidable failures, thus, supporting the following runtime fault-tolerant techniques. In this work, we propose a novel anomaly detection method named DistAD, which is based on the distribution of software runtime dynamic execution traces. Unlike other existing works using key performance indicators, the execution trace is collected during runtime via intrusive instrumentation. Instrumentation are controlled following a sampling mechanism to avoid excessive overheads. Bi-directional Long Short-Term Memory (Bi-LSTM), an architecture of Recurrent Neural Network (RNN) is used to achieve the anomaly detection. The whole framework is constructed under a One-Class Neural Network (OCNN) learning mode which can help eliminate the limits of lacking for enough labeled samples and the data imbalance issues. A series of controlled experiments are conducted on a widely used database system named Cassandra to prove the validity and feasibility of the proposed method. Overheads brought about by the intrusive probing are also evaluated. The results show that DistAD can achieve more than 70 overheads compared with unmonitored executions.

READ FULL TEXT

page 5

page 7

page 8

page 9

page 11

page 13

page 14

research
10/13/2021

Detection Software Content Failures Using Dynamic Execution Information

Modern software systems become too complex to be tested and validated. D...
research
05/28/2021

A Survey on Anomaly Detection for Technical Systems using LSTM Networks

Anomalies represent deviations from the intended system operation and ca...
research
03/17/2023

A Bi-LSTM Autoencoder Framework for Anomaly Detection – A Case Study of a Wind Power Dataset

Anomalies refer to data points or events that deviate from normal and ho...
research
02/13/2018

Detecting Spacecraft Anomalies Using LSTMs and Nonparametric Dynamic Thresholding

As spacecraft send back increasing amounts of telemetry data, improved a...
research
12/13/2021

Challenges and Solutions to Build a Data Pipeline to Identify Anomalies in Enterprise System Performance

We discuss how VMware is solving the following challenges to harness dat...
research
04/10/2019

Deep Learning for System Trace Restoration

Most real-world datasets, and particularly those collected from physical...
research
08/02/2020

Looking in the Right place for Anomalies: Explainable AI through Automatic Location Learning

Deep learning has now become the de facto approach to the recognition of...

Please sign up or login with your details

Forgot password? Click here to reset