Power-of-d-Choices with Memory: Fluid Limit and Optimality

by   Anselmi Jonatha, et al.
Université de Bordeaux

In multi-server distributed queueing systems, the access of stochastically arriving jobs to resources is often regulated by a dispatcher, also known as load balancer. A fundamental problem consists in designing a load balancing algorithm that minimizes the delays experienced by jobs. During the last two decades, the power-of-d-choice algorithm, based on the idea of dispatching each job to the least loaded server out of d servers randomly sampled at the arrival of the job itself, has emerged as a breakthrough in the foundations of this area due to its versatility and appealing asymptotic properties. In this paper, we consider the power-of-d-choice algorithm with the addition of a local memory that keeps track of the latest observations collected over time on the sampled servers. Then, each job is sent to a server with the lowest observation. We show that this algorithm is asymptotically optimal in the sense that the load balancer can always assign each job to an idle server in the large-server limit. This holds true if and only if the system load λ is less than 1-1/d. If this condition is not satisfied, we show that queue lengths are bounded by j^+1, where j^∈N is given by the solution of a polynomial equation. This is in contrast with the classic version of the power-of-d-choice algorithm, where queue lengths are unbounded. Our upper bound on the size of the most loaded server, j^*+1, is tight and increases slowly when λ approaches its critical value from below. For instance, when λ= 0.995 and d=2 (respectively, d=3), we find that no server will contain more than just 5 (3) jobs in equilibrium. Our results quantify and highlight the importance of using memory as a means to enhance performance in randomized load balancing.


page 1

page 2

page 3

page 4


Proximity Based Load Balancing Policies on Graphs: A Simulation Study

Distributed load balancing is the act of allocating jobs among a set of ...

The Power of Two Choices with Load Comparison Errors

In this paper, we analyze the effects of erroneous load comparisons on t...

Optimal Hyper-Scalable Load Balancing with a Strict Queue Limit

Load balancing plays a critical role in efficiently dispatching jobs in ...

Distributed Dispatching in the Parallel Server Model

With the rapid increase in the size and volume of cloud services and dat...

Energy-Efficient Job-Assignment Policy with Asymptotically Guaranteed Performance Deviation

We study a job-assignment problem in a large-scale server farm system wi...

Derandomized Load Balancing using Random Walks on Expander Graphs

In a computing center with a huge amount of machines, when a job arrives...

Load Balancing in Heterogeneous Server Clusters: Insights From a Product-Form Queueing Model

Efficiently exploiting servers in data centers requires performance anal...

Please sign up or login with your details

Forgot password? Click here to reset