DeepAI AI Chat
Log In Sign Up

DeepSIP: A System for Predicting Service Impact of Network Failure by Temporal Multimodal CNN

by   Yoichi Matsuo, et al.
Osaka University

When a failure occurs in a network, network operators need to recognize service impact, since service impact is essential information for handling failures. In this paper, we propose Deep learning based Service Impact Prediction (DeepSIP), a system to predict the time to recovery from the failure and the loss of traffic volume due to the failure in a network element using a temporal multimodal convolutional neural network (CNN). Since the time to recovery is useful information for a service level agreement (SLA) and the loss of traffic volume is directly related to the severity of the failures, we regard these as the service impact. The service impact is challenging to predict, since a network element does not explicitly contain any information about the service impact. Thus, we aim to predict the service impact from syslog messages and traffic volume by extracting hidden information about failures. To extract useful features for prediction from syslog messages and traffic volume which are multimodal and strongly correlated, and have temporal dependencies, we use temporal multimodal CNN. We experimentally evaluated DeepSIP and DeepSIP reduced prediction error by approximately 50 with other NN-based methods with a synthetic dataset.


A Non-intrusive Failure Prediction Mechanism for Deployed Optical Networks

Failures in optical network backbone can lead to major disruption of int...

Actionable and Interpretable Fault Localization for Recurring Failures in Online Service Systems

Fault localization is challenging in an online service system due to its...

Statistical Failure Mechanism Analysis of Earthquakes Revealing Time Relationships

If we assume that earthquakes are chaotic, and influenced locally then c...

A Data Augmented Bayesian Network for Node Failure Prediction in Optical Networks

Failures in optical network backbone can cause significant interruption ...

Recovery command generation towards automatic recovery in ICT systems by Seq2Seq learning

With the increase in scale and complexity of ICT systems, their operatio...

Warranty Cost Estimation Using Bayesian Network

All multi-component product manufacturing companies face the problem of ...