Recovery command generation towards automatic recovery in ICT systems by Seq2Seq learning

03/24/2020
by   Hiroki Ikeuchi, et al.
0

With the increase in scale and complexity of ICT systems, their operation increasingly requires automatic recovery from failures. Although it has become possible to automatically detect anomalies and analyze root causes of failures with current methods, making decisions on what commands should be executed to recover from failures still depends on manual operation, which is quite time-consuming. Toward automatic recovery, we propose a method of estimating recovery commands by using Seq2Seq, a neural network model. This model learns complex relationships between logs obtained from equipment and recovery commands that operators executed in the past. When a new failure occurs, our method estimates plausible commands that recover from the failure on the basis of collected logs. We conducted experiments using a synthetic dataset and realistic OpenStack dataset, demonstrating that our method can estimate recovery commands with high accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/21/2019

Turning Privacy Constraints into Syslog Analysis Advantage

The mean time between failures (MTBF) of HPC systems is rapidly reducing...
research
07/01/2019

Understanding Fault Scenarios and Impacts through Fault Injection Experiments in Cielo

We present a set of fault injection experiments performed on the ACES (L...
research
09/06/2019

Automatic Failure Recovery for End-User Programs on Service Mobile Robots

For service mobile robots to be most effective, it must be possible for ...
research
08/07/2023

Recoverable and Detectable Self-Implementations of Swap

Recoverable algorithms tolerate failures and recoveries of processes by ...
research
12/31/2020

Heterogeneous recovery from large scale power failures

Large-scale power failures are induced by nearly all natural disasters f...
research
09/19/2022

Rapid Recovery of Program Execution Under Power Failures for Embedded Systems with NVM

After power is switched on, recovering the interrupted program from the ...
research
10/21/2017

Seamless Paxos Coordinators

The Paxos algorithm requires a single correct coordinator process to ope...

Please sign up or login with your details

Forgot password? Click here to reset