MDS coding is better than replication for job completion times

07/25/2019
by   Ken Duffy, et al.
0

In a multi-server system, how can one get better performance than random assignment of jobs to servers if queue-states cannot be queried by the dispatcher? A replication strategy has recently been proposed where d copies of each arriving job are sent to servers chosen at random. The job's completion time is the first time that the service of any of its copies is complete. On completion, redundant copies of the job are removed from other queues so as not to overburden the system. For digital jobs, where the objects to be served can be algebraically manipulated, and for servers whose ouput is a linear function of their input, here we consider an alternate strategy: Maximum Distance Separable (MDS) codes. For every batch of n digital jobs that arrive, n+m linear combinations are created over the reals or a large finite field, and each coded job is sent to a random server. The batch completion time is the first time that any n of the n+m coded jobs are served, as the evaluation of n original jobs can be recovered by Gaussian elimination. If redundant jobs can be removed from queues on batch completion, we establish that in order to get the improved response-time performance of sending d copies of each of n jobs via the replication strategy, with the MDS methodology it suffices to send n+d jobs. That is, while replication is multiplicative, MDS is linear. If redunant jobs cannot be removed from queues on batch completion, the stability regions of the two strategies are distinct and the performance with MDS codes is better still.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/01/2020

Stability for Two-class Multiserver-job Systems

Multiserver-job systems, where jobs require concurrent service at many s...
research
08/08/2020

Achievable Stability in Redundancy Systems

We consider a system with N parallel servers where incoming jobs are imm...
research
09/05/2019

Straggler Mitigation with Tiered Gradient Codes

Coding theoretic techniques have been proposed for synchronous Gradient ...
research
07/08/2022

Tackling Heterogeneous Traffic in Multi-access Systems via Erasure Coded Servers

Most data generated by modern applications is stored in the cloud, and t...
research
12/25/2020

Synergy via Redundancy: Adaptive Replication Strategies and Fundamental Limits

The maximum possible throughput (or the rate of job completion) of a mul...
research
01/16/2021

Sensitivity of Mean-Field Fluctuations in Erlang loss models with randomized routing

In this paper, we study a large system of N servers each with capacity t...
research
09/28/2020

On the sojourn time of a batch in the M^[X]/M/1 Processor Sharing Queue

In this paper, we analyze the sojourn of an entire batch in a processor ...

Please sign up or login with your details

Forgot password? Click here to reset