Data Replication for Reducing Computing Time in Distributed Systems with Stragglers

12/06/2019
by   Amir Behrouzi-Far, et al.
0

In distributed computing systems with stragglers, various forms of redundancy can improve the average delay performance. We study the optimal replication of data in systems where the job execution time is a stochastically decreasing and convex random variable. We show that in such systems, the optimum assignment policy is the balanced replication of disjoint batches of data. Furthermore, for Exponential and Shifted-Exponential service times, we derive the optimum redundancy levels for minimizing both expected value and the variance of the job completion time. Our analysis shows that, the optimum redundancy level may not be the same for the two metrics, thus there is a trade-off between reducing the expected value of the completion time and reducing its variance.

READ FULL TEXT
research
12/06/2019

Data Replication for Reducing Computing Time inDistributed Systems with Stragglers

In distributed computing systems with stragglers,various forms of redund...
research
06/03/2020

Efficient Replication for Straggler Mitigation in Distributed Computing

Master-worker distributed computing systems use task replication in orde...
research
03/01/2023

Computing Redundancy in Blocking Systems: Fast Service or No Service

Redundancy in distributed computing systems reduces job completion time....
research
10/05/2020

Diversity/Parallelism Trade-off in Distributed Systems with Redundancy

As numerous machine learning and other algorithms increase in complexity...
research
08/08/2018

On the Effect of Task-to-Worker Assignment in Distributed Computing Systems with Stragglers

We study the expected completion time of some recently proposed algorith...
research
12/25/2020

Synergy via Redundancy: Adaptive Replication Strategies and Fundamental Limits

The maximum possible throughput (or the rate of job completion) of a mul...
research
10/02/2017

Effective Straggler Mitigation: Which Clones Should Attack and When?

Redundancy for straggler mitigation, originally in data download and mor...

Please sign up or login with your details

Forgot password? Click here to reset