A State Transfer Method That Adapts to Network Bandwidth Variations in Geographic State Machine Replication

10/09/2021
by   Tairi Chiba, et al.
0

We present a new state transfer method for geographic State Machine Replication (SMR) that dynamically allocates the state to be transferred among replicas according to changes in communication bandwidths. SMR is a method that improves fault tolerance by replicating a service to multiple replicas. When a replica is newly added or is recovered from a failure, the other replicas transfer the current state of the service to it. However, in geographic SMR, the communication bandwidths of replicas are different and constantly changing. Therefore, existing state transfer methods cannot fully utilize the available bandwidth, and their state transfer time becomes long. To overcome this problem, our method divides the state into multiple chunks and assigns them to replicas based on each replica's bandwidth so that the broader a replica's bandwidth is, the more chunks it transfers. The number of assigned chunks is dynamically updated based on the currently estimated bandwidth. The performance evaluation on Amazon EC2 shows that the proposed method reduces the state transfer time by up to 47

READ FULL TEXT
research
04/19/2022

Network Bandwidth Variation-Adapted State Transfer for Geo-Replicated State Machines and its Application to Dynamic Replica Replacement

This paper proposes a new state transfer method for geographic state mac...
research
10/09/2021

Evaluation and Ranking of Replica Deployments in Geographic State Machine Replication

Geographic state machine replication (SMR) is a replication method in wh...
research
03/12/2019

Communication Bandwidth for Emerging Networks: Trends and Prospects

Bandwidth is one of the essential resources for communication. Due to th...
research
05/14/2018

Early Scheduling in Parallel State Machine Replication

State machine replication is standard approach to fault tolerance. One o...
research
01/23/2021

HyCoR: Fault-Tolerant Replicated Containers Based on Checkpoint and Replay

HyCoR is a fully-operational fault tolerance mechanism for multiprocesso...
research
11/19/2020

Designing an Adaptive Bandwidth Management for Higher Education Institutions

Purpose: This study proposes an adaptive bandwidth management system whi...
research
12/16/2018

Fast and Efficient Bulk Multicasting over Dedicated Inter-Datacenter Networks

Several organizations have built multiple datacenters connected via dedi...

Please sign up or login with your details

Forgot password? Click here to reset