Decentralized Validation for Non-malicious Arbitrary Fault Tolerance in Paxos

02/18/2020
by   Rodrigo R. Barbieri, et al.
0

Fault-tolerant distributed systems offer high reliability because even if faults in their components occur, they do not exhibit erroneous behavior. Depending on the fault model adopted, hardware and software errors that do not result in a process crashing are usually not tolerated. To tolerate these rather common failures the usual solution is to adopt a stronger fault model, such as the arbitrary or Byzantine fault model. Algorithms created for this fault model, however, are considerably more complex and require more system resources than the ones developed for less strict fault models. One approach to reach a middle ground is the non-malicious arbitrary fault model. This model assumes it is possible to detect and filter faults with a given probability, if these faults are not created with malicious intent, allowing the isolation and mapping of these faults to benign faults. In this paper we describe how we incremented an implementation of active replication in the non-malicious fault model with a basic type of distributed validation, where a deviation from the expected algorithm behavior will make a process crash. We experimentally evaluate this implementation using a fault injection framework showing that it is feasible to extend the concept of non-malicious failures beyond hardware failures.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/20/2017

Hardened Paxos Through Consistency Validation

Due to the emergent adoption of distributed systems when building applic...
research
09/30/2015

Fault Tolerance in Distributed Neural Computing

With the increasing complexity of computing systems, complete hardware r...
research
05/28/2021

ARMORY: Fully Automated and Exhaustive Fault Simulation on ARM-M Binaries

Embedded systems are ubiquitous. However, physical access of users and l...
research
04/01/2004

On the Practicality of Intrinsic Reconfiguration As a Fault Recovery Method in Analog Systems

Evolvable hardware combines the powerful search capability of evolutiona...
research
10/16/2022

Towards Dynamic Fault Tolerance for Hardware-Implemented Artificial Neural Networks: A Deep Learning Approach

The functionality of electronic circuits can be seriously impaired by th...
research
10/07/2021

FaaSter Troubleshooting – Evaluating Distributed Tracing Approaches for Serverless Applications

Serverless applications can be particularly difficult to troubleshoot, a...
research
07/25/2018

Skew Adjustment Factors for Fragilities of California Box-Girder Bridges Subjected to Near-Fault and Far-Field Ground Motions

Past reconnaissance studies revealed that bridges close to active faults...

Please sign up or login with your details

Forgot password? Click here to reset