Oversampling Log Messages Using a Sequence Generative Adversarial Network for Anomaly Detection and Classification

12/09/2019
by   Amir Farzad, et al.
1

Dealing with imbalanced data is one the main challenges in machine/deep learning algorithms for classification. This issue is more important with log message data as it is typically imbalanced and negative logs are rare. In this paper, a model is proposed to generate text log messages using a SeqGAN network. Then features are extracted using an Autoencoder and anomaly detection and classification is done using a GRU network. The proposed model is evaluated with two imbalanced log data sets, namely BGL and Openstack. Results are presented which show that oversampling and balancing data increases the accuracy of anomaly detection and classification.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset