Delay-optimal policies in partial fork-join systems with redundancy and random slowdowns

10/21/2019
by   Martin Zubeldia, et al.
0

We consider a large distributed service system consisting of n homogeneous servers with infinite capacity FIFO queues. Jobs arrive as a Poisson process of rate λ n/k_n (for some positive constant λ and integer k_n). Each incoming job consists of k_n identical tasks that can be executed in parallel, and that can be encoded into at least k_n "replicas" of the same size (by introducing redundancy) so that the job is considered to be completed when any k_n replicas associated with it finish their service. Moreover, we assume that servers can experience random slowdowns in their processing rate so that the service time of a replica is the product of its size and a random slowdown. First, we assume that the server slowdowns are shifted exponential and independent of the replica sizes. In this setting we show that the delay of a typical job is asymptotically minimized (as n→∞) when the number of replicas per task is a constant that only depends on the arrival rate λ, and on the expected slowdown of servers. Second, we introduce a new model for the server slowdowns in which larger tasks experience less variable slowdowns than smaller tasks. In this setting we show that, under the class of policies where all replicas start their service at the same time, the delay of a typical job is asymptotically minimized (as n→∞) when the number of replicas per task is made to depend on the actual size of the tasks being replicated, with smaller tasks being replicated more than larger tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/01/2017

Asymptotic response time analysis for multi-task parallel jobs

The response time of jobs with multiple parallel tasks is a critical per...
research
05/28/2021

Parallel server systems with cancel-on-completion redundancy

We consider a parallel server system with so-called cancel-on-completion...
research
07/10/2020

Stability, memory, and messaging tradeoffs in heterogeneous service systems

We consider a heterogeneous distributed service system, consisting of n ...
research
05/31/2020

Staffing for many-server systems facing non-standard arrival processes

Arrival processes to service systems often display (i) larger than antic...
research
04/05/2020

Achieving Zero Asymptotic Queueing Delay for Parallel Jobs

Zero queueing delay is highly desirable in large-scale computing systems...
research
10/01/2017

Delay Asymptotics and Bounds for Multi-Task Parallel Jobs

We study delay of jobs that consist of multiple parallel tasks, which is...
research
06/09/2019

Partial Server Pooling in Redundancy Systems

Partial sharing allows providers to possibly pool a fraction of their re...

Please sign up or login with your details

Forgot password? Click here to reset