uBFT: Microsecond-scale BFT using Disaggregated Memory [Extended Version]

10/31/2022
by   Marcos K. Aguilera, et al.
0

We propose uBFT, the first State Machine Replication (SMR) system to achieve microsecond-scale latency in data centers, while using only 2f+1 replicas to tolerate f Byzantine failures. The Byzantine Fault Tolerance (BFT) provided by uBFT is essential as pure crashes appear to be a mere illusion with real-life systems reportedly failing in many unexpected ways. uBFT relies on a small non-tailored trusted computing base – disaggregated memory – and consumes a practically bounded amount of memory. uBFT is based on a novel abstraction called Consistent Tail Broadcast, which we use to prevent equivocation while bounding memory. We implement uBFT using RDMA-based disaggregated memory and obtain an end-to-end latency of as little as 10us. This is at least 50× faster than MinBFT , a state of the art 2f+1 BFT SMR based on Intel's SGX. We use uBFT to replicate two KV-stores (Memcached and Redis), as well as a financial order matching engine (Liquibook). These applications have low latency (up to 20us) and become Byzantine tolerant with as little as 10us more. The price for uBFT is a small amount of reliable disaggregated memory (less than 1 MiB), which in our prototype consists of a small number of memory servers connected through RDMA and replicated for fault tolerance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/14/2021

Good-case Latency of Byzantine Broadcast: a Complete Categorization

This paper explores the problem good-case latency of Byzantine fault-tol...
research
07/10/2018

DXRAM's Fault-Tolerance Mechanisms Meet High Speed I/O Devices

In-memory key-value stores provide consistent low-latency access to all ...
research
05/21/2021

Classifying Trusted Hardware via Unidirectional Communication

It is well known that Byzantine fault tolerant (BFT) consensus cannot be...
research
03/25/2020

FastPay: High-Performance Byzantine Fault Tolerant Settlement

FastPay allows a set of distributed authorities, some of which are Byzan...
research
01/13/2022

Low-latency, Scalable, DeFi with Zef

Zef was recently proposed to extend the low-latency, Byzantine-Fault Tol...
research
11/08/2019

PnyxDB: a Lightweight Leaderless Democratic Byzantine Fault Tolerant Replicated Datastore

Byzantine-Fault-Tolerant (BFT) systems are rapidly emerging as a viable ...
research
02/22/2018

NetChain: Scale-Free Sub-RTT Coordination (Extended Version)

Coordination services are a fundamental building block of modern cloud s...

Please sign up or login with your details

Forgot password? Click here to reset