Threshold-based rerouting and replication for resolving job-server affinity relations

05/27/2020
by   Youri Raaijmakers, et al.
0

We consider a system with several job types and two parallel server pools. Within the pools the servers are homogeneous, but across pools possibly not in the sense that the service speed of a job may depend on its type as well as the server pool. Immediately upon arrival, jobs are assigned to a server pool. This could be based on (partial) knowledge of their type, but such knowledge might not be available. Information about the job type can however be obtained while the job is in service; as the service progresses, the likelihood that the service speed of this job type is low increases, creating an incentive to execute the job on different, possibly faster, server(s). Two policies are considered: reroute the job to the other server pool, or replicate it there. We determine the effective load per server under both the rerouting and replication policy for completely unknown as well as partly known job types. We also examine the impact of these policies on the stability bound, and find that the uncertainty in job types may significantly degrade the performance. For (highly) unbalanced service speeds full replication achieves the largest stability bound while for (nearly) balanced service speeds no replication maximizes the stability bound. Finally, we discuss how the use of threshold-based policies can help improve the expected latency for completely or partly unknown job types.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/08/2020

Achievable Stability in Redundancy Systems

We consider a system with N parallel servers where incoming jobs are imm...
research
12/25/2020

Synergy via Redundancy: Adaptive Replication Strategies and Fundamental Limits

The maximum possible throughput (or the rate of job completion) of a mul...
research
04/21/2021

Stability and Optimization of Speculative Queueing Networks

We provide a queueing-theoretic framework for job replication schemes ba...
research
10/26/2020

Load balancing policies with server-side cancellation of replicas

Popular dispatching policies such as the join shortest queue (JSQ), join...
research
04/09/2022

Learning to Dispatch Multi-Server Jobs in Bipartite Graphs with Unknown Service Rates

Multi-server jobs are imperative in modern cloud computing systems. A mu...
research
02/04/2022

Learning a Discrete Set of Optimal Allocation Rules in Queueing Systems with Unknown Service Rates

We study learning-based admission control for a classical Erlang-B block...
research
06/09/2019

Partial Server Pooling in Redundancy Systems

Partial sharing allows providers to possibly pool a fraction of their re...

Please sign up or login with your details

Forgot password? Click here to reset