Decentralized Periodic Approach for Adaptive Fault Diagnosis in Distributed Systems

12/19/2018
by   Latika Sarna, et al.
0

In this paper, Decentralized Periodic Approach for Adaptive Fault Diagnosis (DP-AFD) algorithm is proposed for fault diagnosis in distributed systems with arbitrary topology. Faulty nodes may be either unresponsive, may have either software or hardware faults. The proposed algorithm detects the faulty nodes situated in geographically distributed locations. This algorithm does not depend on a single node or leader to detect the faults in the system. However, it empowers more than one node to detect the fault-free and faulty nodes in the system. Thus, at the end of each test cycle, every fault-free node acts as a leader to diagnose faults in the system. This feature of the algorithm makes it applicable to any arbitrary network. After every test cycle of the algorithm, all the nodes have knowledge about faulty nodes and each node is tested only once. With this knowledge, there can be redistribution of load, which was earlier assigned to the faulty nodes. Also, the algorithm permits repaired node re-entry and new node entry. In a system of n nodes, the maximum number of faulty nodes can be (n-1) which is detected by DP-AFD algorithm. DP-AFD is periodic in nature which executes test cycles after regular intervals to detect the faulty nodes in the given distributed system.

READ FULL TEXT

page 13

page 14

page 15

page 16

research
12/19/2018

Fault Diagnosis for Distributed Systems using Accuracy Technique

Distributed Systems involve two or more computer systems which may be si...
research
03/23/2023

Amalgamated Intermittent Computing Systems

Intermittent computing systems undergo frequent power failure, hindering...
research
06/30/2021

A Local Diagnosis Algorithm for Hypercube-like Networks under the BGM Diagnosis Model

System diagnosis is process of identifying faulty nodes in a system. An ...
research
02/20/2018

Cobalt: BFT Governance in Open Networks

We present Cobalt, a novel atomic broadcast algorithm that works in netw...
research
09/23/2021

Fault Localization in Cloud using Centrality Measures

Fault localization is an imperative method in fault tolerance in a distr...
research
07/04/2012

Efficient Test Selection in Active Diagnosis via Entropy Approximation

We consider the problem of diagnosing faults in a system represented by ...
research
11/09/2020

Toward Fault-Tolerant Deadlock-Free Routing in HyperSurface-Embedded Controller Networks

HyperSurfaces (HSFs) consist of structurally reconfigurable metasurfaces...

Please sign up or login with your details

Forgot password? Click here to reset