Fault-tolerant computing with unreliable channels

We study implementations of basic fault-tolerant primitives, such as consensus and registers, in message-passing systems subject to process crashes and a broad range of communication failures. Our results characterize the necessary and sufficient conditions for implementing these primitives as a function of the connectivity constraints and synchrony assumptions. Our main contribution is a new algorithm for partially synchronous consensus that is resilient to process crashes and channel failures and is optimal in its connectivity requirements. In contrast to prior work, our algorithm assumes the most general model of message loss where faulty channels are flaky, i.e., can lose messages without any guarantee of fairness. This failure model is particularly challenging for consensus algorithms, as it rules out standard solutions based on leader oracles and failure detectors. To circumvent this limitation, we construct our solution using a new variant of the recently proposed view synchronizer abstraction, which we adapt to the crash-prone setting with flaky channels.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/30/2022

Self-stabilizing Byzantine-tolerant Broadcast

We study a well-known communication abstraction called Byzantine Reliabl...
research
04/07/2021

Self-stabilizing Multivalued Consensus in Asynchronous Crash-prone Systems

The problem of multivalued consensus is fundamental in the area of fault...
research
10/05/2018

Fault-Tolerant Consensus with an Abstract MAC Layer

In this paper, we study fault-tolerant distributed consensus in wireless...
research
11/24/2019

ACE: Abstract Consensus Encapsulation for Liveness Boosting of State Machine Replication

With the emergence of cross-organization attack-prone byzantine fault-to...
research
02/10/2020

The Istanbul BFT Consensus Algorithm

This paper presents Istanbul BFT (IBFT), a Byzantine fault-tolerant (BFT...
research
08/09/2023

Multi-Valued Connected Consensus: A New Perspective on Crusader Agreement and Adopt-Commit

Algorithms to solve fault-tolerant consensus in asynchronous systems oft...
research
08/23/2022

Null Messages, Information and Coordination

This paper investigates the transfer of information in fault-prone synch...

Please sign up or login with your details

Forgot password? Click here to reset