Achievable Stability in Redundancy Systems

08/08/2020
by   Youri Raaijmakers, et al.
0

We consider a system with N parallel servers where incoming jobs are immediately replicated to, say, d servers. Each of the N servers has its own queue and follows a FCFS discipline. As soon as the first job replica is completed, the remaining replicas are abandoned. We investigate the achievable stability region for a quite general workload model with different job types and heterogeneous servers, reflecting job-server affinity relations which may arise from data locality issues and soft compatibility constraints. Under the assumption that job types are known beforehand we show for New-Better-than-Used (NBU) distributed speed variations that no replication (d=1) gives a strictly larger stability region than replication (d>1). Strikingly, this does not depend on the underlying distribution of the intrinsic job sizes, but observing the job types is essential for this statement to hold. In case of non-observable job types we show that for New-Worse-than-Used (NWU) distributed speed variations full replication (d=N) gives a larger stability region than no replication (d=1).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/27/2020

Threshold-based rerouting and replication for resolving job-server affinity relations

We consider a system with several job types and two parallel server pool...
research
04/21/2021

Stability and Optimization of Speculative Queueing Networks

We provide a queueing-theoretic framework for job replication schemes ba...
research
07/25/2019

MDS coding is better than replication for job completion times

In a multi-server system, how can one get better performance than random...
research
12/25/2020

Synergy via Redundancy: Adaptive Replication Strategies and Fundamental Limits

The maximum possible throughput (or the rate of job completion) of a mul...
research
07/08/2022

Tackling Heterogeneous Traffic in Multi-access Systems via Erasure Coded Servers

Most data generated by modern applications is stored in the cloud, and t...
research
06/19/2020

Large-scale parallel server system with multi-component jobs

A broad class of parallel server systems is considered, for which we pro...
research
02/15/2018

On the Power-of-d-choices with Least Loaded Server Selection

Motivated by distributed schedulers that combine the power-of-d-choices ...

Please sign up or login with your details

Forgot password? Click here to reset