Data Replication for Reducing Computing Time inDistributed Systems with Stragglers

12/06/2019
by   Amir Behrouzi-Far, et al.
0

In distributed computing systems with stragglers,various forms of redundancy can improve the average delayperformance. We study the optimal replication of data in systemswhere the job execution time is a stochastically decreasing andconvex random variable. We show that in such systems, theoptimum assignment policy is the balanced replication of disjointbatches of data. Furthermore, for Exponential and Shifted-Exponential service times, we derive the optimum redundancylevels for minimizing both expected value and the variance ofthe job completion time. Our analysis shows that, the optimumredundancy level may not be the same for the two metrics, thusthere is a trade-off between reducing the expected value of thecompletion time and reducing its variance.

READ FULL TEXT

page 1

page 2

page 3

research
12/06/2019

Data Replication for Reducing Computing Time in Distributed Systems with Stragglers

In distributed computing systems with stragglers, various forms of redun...
research
06/03/2020

Efficient Replication for Straggler Mitigation in Distributed Computing

Master-worker distributed computing systems use task replication in orde...
research
03/01/2023

Computing Redundancy in Blocking Systems: Fast Service or No Service

Redundancy in distributed computing systems reduces job completion time....
research
10/05/2020

Diversity/Parallelism Trade-off in Distributed Systems with Redundancy

As numerous machine learning and other algorithms increase in complexity...
research
08/08/2018

On the Effect of Task-to-Worker Assignment in Distributed Computing Systems with Stragglers

We study the expected completion time of some recently proposed algorith...
research
12/25/2020

Synergy via Redundancy: Adaptive Replication Strategies and Fundamental Limits

The maximum possible throughput (or the rate of job completion) of a mul...
research
04/09/2018

Predicting Dynamic Replication based on Fuzzy System in Data Grid

Data grid replication is an effective method to achieve efficient and fa...

Please sign up or login with your details

Forgot password? Click here to reset