Scheduling and Checkpointing optimization algorithm for Byzantine fault tolerance in Cloud Clusters

02/03/2018
by   Sathya Chinnathambi, et al.
0

Among those faults Byzantine faults offers serious challenge to fault tolerance mechanism, because it often go undetected at the initial stage and it can easily propagate to other VMs before a detection is made. Consequently some of the mission critical application such as air traffic control, online baking etc still staying away from the cloud for such reasons. However if a Byzantine faults is not detected and tolerated at initial stage then applications such as big data analytics can go completely wrong in spite of hours of computations performed by the entire cloud. Therefore in the previous work a fool-proof Byzantine fault detection has been proposed, as a continuation this work designs a scheduling algorithm (WSSS) and checkpoint optimization algorithm (TCC) to tolerate and eliminate the Byzantine faults before it makes any impact. The WSSS algorithm keeps track of server performance which is part of Virtual Clusters to help allocate best performing server to mission critical application. WSSS therefore ranks the servers based on a counter which monitors every Virtual Nodes (VN) for time and performance failures. The TCC algorithm works to generalize the possible Byzantine error prone region through monitoring delay variation to start new VNs with previous checkpointing. Moreover it can stretch the state interval for performing and error free VNs in an effect to minimize the space, time and cost overheads caused by checkpointing. The analysis is performed with plotting state transition and CloudSim based simulation. The result shows TCC reduces fault tolerance overhead exponentially and the WSSS allots virtual resources effectively

READ FULL TEXT

page 16

page 17

page 18

research
01/26/2018

Enhancing Byzantine fault tolerance using MD5 checksum and delay variation in Cloud services

Cloud computing management are beyond typical human narratives. However ...
research
09/22/2021

On Conflict-Free Replicated Data Types and Equivocation in Byzantine Setups

We explore the property of equivocation tolerance for Conflict-Free Repl...
research
09/30/2020

Byzantine Fault-Tolerance in Decentralized Optimization under Minimal Redundancy

This paper considers the problem of Byzantine fault-tolerance in multi-a...
research
02/08/2020

On Probabilistic Byzantine Fault Tolerance

Byzantine fault tolerance (BFT) has been extensively studied in distribu...
research
05/16/2023

Availability Evaluation of IoT Systems with Byzantine Fault-Tolerance for Mission-critical Applications

Byzantine fault-tolerant (BFT) systems are able to maintain the availabi...
research
02/25/2022

VLSM: Validating Labelled State Transition and Message Production Systems

In this paper we introduce the notion of a validating labelled state tra...
research
01/26/2018

Revisiting Fast Practical Byzantine Fault Tolerance: Thelma, Velma, and Zelma

In a previous note (arXiv:1712.01367 [cs.DC]) , we observed a safety vio...

Please sign up or login with your details

Forgot password? Click here to reset