The Impact of RDMA on Agreement

05/29/2019
by   Marcos K. Aguilera, et al.
0

Remote Direct Memory Access (RDMA) is becoming widely available in data centers. This technology allows a process to directly read and write the memory of a remote host, with a mechanism to control access permissions. In this paper, we study the fundamental power of these capabilities. We consider the well-known problem of achieving consensus despite failures, and find that RDMA can improve the inherent trade-off in distributed computing between failure resilience and performance. Specifically, we show that RDMA allows algorithms that simultaneously achieve high resilience and high performance, while traditional algorithms had to choose one or another. With Byzantine failures, we give an algorithm that only requires n ≥ 2f_P + 1 processes (where f_P is the maximum number of faulty processes) and decides in two (network) delays in common executions. With crash failures, we give an algorithm that only requires n ≥ f_P + 1 processes and also decides in two delays. Both algorithms tolerate a minority of memory failures inherent to RDMA, and they provide safety in asynchronous systems and liveness with standard additional assumptions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/25/2021

Revisiting Optimal Resilience of Fast Byzantine Consensus

It is a common belief that Byzantine fault-tolerant solutions for consen...
research
09/20/2023

Better Sooner Rather Than Later

This article unifies and generalizes fundamental results related to n-pr...
research
04/27/2018

Recoverable Consensus in Shared Memory

Herlihy's consensus hierarchy is one of the most widely cited results in...
research
08/09/2023

Multi-Valued Connected Consensus: A New Perspective on Crusader Agreement and Adopt-Commit

Algorithms to solve fault-tolerant consensus in asynchronous systems oft...
research
09/12/2019

Fully Anonymous Shared Memory Algorithms

Process anonymity has been studied for a long time. Memory anonymity is ...
research
06/12/2020

An Adaptive Approach to Recoverable Mutual Exlcusion

Mutual exclusion (ME) is one of the most commonly used techniques to han...
research
10/21/2017

Seamless Paxos Coordinators

The Paxos algorithm requires a single correct coordinator process to ope...

Please sign up or login with your details

Forgot password? Click here to reset