PigPaxos: Devouring the communication bottlenecks in distributed consensus

03/17/2020
by   Aleksey Charapko, et al.
0

Paxos family of protocols are employed by many cloud computing services and distributed databases due to their excellent fault-tolerance properties. Unfortunately, current Paxos deployments do not scale for more than a dozen nodes due to the communication bottleneck at the leader. PigPaxos addresses this problem by decoupling the communication from the decision-making at the leader. To this end, PigPaxos revises the communication flow in Paxos to replace direct communication between the leader and followers with a relay/aggregate based message flow. Although aggregation-based approaches have been employed in the context of weak-consistency replication protocols, PigPaxos shows how they can be effectively integrated into the strong consistency distributed consensus protocols. We implement and evaluate PigPaxos, in comparison to Paxos and EPaxos protocols under various workloads over clusters of size 5 to 25 nodes. We show that PigPaxos can provide more than 3 folds improved throughput over Paxos and EPaxos with little latency deterioration. Our experiments also show that the aggregation has negligible overhead for the latency of PigPaxos as compared to the latency of Paxos. We conjecture that PigPaxos would be useful for implementing geo-replicated distributed databases with tens of replicas distributed over many regions around the globe.

READ FULL TEXT
research
09/26/2021

BigBFT: A Multileader Byzantine Fault Tolerance Protocol for High Throughput

This paper describes BigBFT, a multi-leader Byzantine fault tolerance pr...
research
09/11/2019

Cogsworth: Byzantine View Synchronization

Many distributed protocols in the partial synchrony setting with Byzanti...
research
09/11/2019

Lumière: Byzantine View Synchronization

Many distributed protocols in the partial synchrony setting with Byzanti...
research
03/10/2022

State-Machine Replication Scalability Made Simple (Extended Version)

Consensus, state-machine replication (SMR) and total order broadcast (TO...
research
11/04/2019

Raft Consensus Algorithm: an Effective Substitute for Paxos in High Throughput P2P-based Systems

One of the significant problem in peer-to-peer databases is collision pr...
research
10/13/2020

Microsecond Consensus for Microsecond Applications

We consider the problem of making apps fault-tolerant through replicatio...
research
01/19/2020

CycLedger: A Scalable and Secure Parallel Protocol for Distributed Ledger via Sharding

Traditional public distributed ledgers have not been able to scale-out w...

Please sign up or login with your details

Forgot password? Click here to reset