On the Power-of-d-choices with Least Loaded Server Selection

02/15/2018
by   Tim Hellemans, et al.
0

Motivated by distributed schedulers that combine the power-of-d-choices with late binding and systems that use replication with cancellation-on-start, we study the performance of the LL(d) policy which assigns a job to a server that currently has the least workload among d randomly selected servers in large-scale homogeneous clusters. We consider general service time distributions and propose a partial integro-differential equation to describe the evolution of the system. This equation relies on the earlier proven ansatz for LL(d) which asserts that the workload distribution of any finite set of queues becomes independent of one another as the number of servers tends to infinity. Based on this equation we propose a fixed point iteration for the limiting workload distribution and study its convergence. For exponential job sizes we present a simple closed form expression for the limiting workload distribution that is valid for any work-conserving service discipline as well as for the limiting response time distribution in case of first-come-first-served scheduling. We further show that for phase-type distributed job sizes the limiting workload and response time distribution can be expressed via the unique solution of a simple set of ordinary differential equations. Numerical and analytical results that compare response time of the classic power-of-d-choices algorithm and the LL(d) policy are also presented and the accuracy of the limiting response time distribution for finite systems is illustrated using simulation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/02/2020

Heavy Traffic Analysis of the Mean Response Time for Load Balancing Policies in the Mean Field Regime

Mean field models are a popular tool used to analyse load balancing poli...
research
11/13/2018

Global attraction of ODE-based mean field models with hyperexponential job sizes

Mean field modeling is a popular approach to assess the performance of l...
research
01/18/2019

Derandomized Load Balancing using Random Walks on Expander Graphs

In a computing center with a huge amount of machines, when a job arrives...
research
06/11/2020

Performance Analysis of Modified SRPT in Multiple-Processor Multitask Scheduling

In this paper we study the multiple-processor multitask scheduling probl...
research
11/14/2019

Optimal Server Selection for Straggler Mitigation

The performance of large-scale distributed compute systems is adversely ...
research
08/08/2020

Achievable Stability in Redundancy Systems

We consider a system with N parallel servers where incoming jobs are imm...
research
06/27/2020

Queues with Small Advice

Motivated by recent work on scheduling with predicted job sizes, we cons...

Please sign up or login with your details

Forgot password? Click here to reset