Evaluation and Ranking of Replica Deployments in Geographic State Machine Replication

10/09/2021
by   Shota Numakura, et al.
0

Geographic state machine replication (SMR) is a replication method in which replicas of a service are located on multiple continents to improve the fault tolerance of a general service. Nowadays, geographic SMR is easily realized using public cloud services; SMR provides extraordinary resilience against catastrophic disasters. Previous studies have revealed that the geographic distribution of the replicas has a significant influence on the performance of the geographic SMR; however, the optimal way for a system integrator to deploy replicas remains unknown. In this paper, we propose a method to evaluate and rank replica deployments to assist a system integrator in deciding a final replica deployment. In the method, we also propose a novel evaluation function that estimates a latency of SMR protocols with round-trip time (RTT). To demonstrate the effectiveness of the proposed method, we build thousands of geographic SMRs on Amazon Web Services and present experimental results. The results show that the proposed method that estimates a latency based on RTTs can generate consistent rankings with reasonable calculation time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/09/2021

A State Transfer Method That Adapts to Network Bandwidth Variations in Geographic State Machine Replication

We present a new state transfer method for geographic State Machine Repl...
research
04/19/2022

Network Bandwidth Variation-Adapted State Transfer for Geo-Replicated State Machines and its Application to Dynamic Replica Replacement

This paper proposes a new state transfer method for geographic state mac...
research
06/18/2019

SeeMoRe: A Fault-Tolerant Protocol for Hybrid Cloud Environments

Large scale data management systems utilize State Machine Replication to...
research
05/24/2023

Chasing the Speed of Light: Low-Latency Planetary-Scale Adaptive Byzantine Consensus

Blockchain technology has sparked renewed interest in planetary-scale By...
research
06/15/2022

PCRAFT: Capacity Planning for Dependable Stateless Services

Fault-tolerance techniques depend on replication to enhance availability...
research
04/27/2019

Collage Inference: Tolerating Stragglers in Distributed Neural Network Inference using Coding

MLaaS (ML-as-a-Service) offerings by cloud computing platforms are becom...
research
03/26/2020

State-Machine Replication for Planet-Scale Systems (Extended Version)

Online applications now routinely replicate their data at multiple sites...

Please sign up or login with your details

Forgot password? Click here to reset