PreGAN: Preemptive Migration Prediction Network for Proactive Fault-Tolerant Edge Computing

12/04/2021
by   Shreshth Tuli, et al.
0

Building a fault-tolerant edge system that can quickly react to node overloads or failures is challenging due to the unreliability of edge devices and the strict service deadlines of modern applications. Moreover, unnecessary task migrations can stress the system network, giving rise to the need for a smart and parsimonious failure recovery scheme. Prior approaches often fail to adapt to highly volatile workloads or accurately detect and diagnose faults for optimal remediation. There is thus a need for a robust and proactive fault-tolerance mechanism to meet service level objectives. In this work, we propose PreGAN, a composite AI model using a Generative Adversarial Network (GAN) to predict preemptive migration decisions for proactive fault-tolerance in containerized edge deployments. PreGAN uses co-simulations in tandem with a GAN to learn a few-shot anomaly classifier and proactively predict migration decisions for reliable computing. Extensive experiments on a Raspberry-Pi based edge environment show that PreGAN can outperform state-of-the-art baseline methods in fault-detection, diagnosis and classification, thus achieving high quality of service. PreGAN accomplishes this by 5.1 detection, higher diagnosis scores and 23.8 best method among the considered baselines.

READ FULL TEXT

page 1

page 7

page 8

research
12/02/2022

DeepFT: Fault-Tolerant Edge Computing using a Self-Supervised Deep Surrogate Model

The emergence of latency-critical AI applications has been supported by ...
research
02/09/2023

Intelligent Proactive Fault Tolerance at the Edge through Resource Usage Prediction

The proliferation of demanding applications and edge computing establish...
research
08/16/2022

DRAGON: Decentralized Fault Tolerance in Edge Federations

Edge Federation is a new computing paradigm that seamlessly interconnect...
research
09/08/2022

Epic Fail: Emulators can tolerate polynomially many edge faults for free

A t-emulator of a graph G is a graph H that approximates its pairwise sh...
research
07/10/2020

Self-healing Dilemmas in Distributed Systems: Fault-correction vs. Fault-tolerance

Large-scale decentralized systems of autonomous agents interacting via a...
research
10/02/2019

Overview of Fault Tolerant Techniques in Underwater Sensor Networks

Sensor networks provide services to a broad range of applications rangin...
research
07/06/2019

Adversarial Fault Tolerant Training for Deep Neural Networks

Deep Learning Accelerators are prone to faults which manifest in the for...

Please sign up or login with your details

Forgot password? Click here to reset