Optimal Hyper-Scalable Load Balancing with a Strict Queue Limit

12/14/2020
by   Mark van der Boor, et al.
0

Load balancing plays a critical role in efficiently dispatching jobs in parallel-server systems such as cloud networks and data centers. A fundamental challenge in the design of load balancing algorithms is to achieve an optimal trade-off between delay performance and implementation overhead (e.g. communication or memory usage). This trade-off has primarily been studied so far from the angle of the amount of overhead required to achieve asymptotically optimal performance, particularly vanishing delay in large-scale systems. In contrast, in the present paper, we focus on an arbitrarily sparse communication budget, possibly well below the minimum requirement for vanishing delay, referred to as the hyper-scalable operating region. Furthermore, jobs may only be admitted when a specific limit on the queue position of the job can be guaranteed. The centerpiece of our analysis is a universal upper bound for the achievable throughput of any dispatcher-driven algorithm for a given communication budget and queue limit. We also propose a specific hyper-scalable scheme which can operate at any given message rate and enforce any given queue limit, while allowing the server states to be captured via a closed product-form network, in which servers act as customers traversing various nodes. The product-form distribution is leveraged to prove that the bound is tight and that the proposed hyper-scalable scheme is throughput-optimal in a many-server regime given the communication and queue limit constraints. Extensive simulation experiments are conducted to illustrate the results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/06/2018

Scalable Load Balancing Algorithms in Networked Systems

A fundamental challenge in large-scale networked systems viz., data cent...
research
02/19/2018

Power-of-d-Choices with Memory: Fluid Limit and Optimality

In multi-server distributed queueing systems, the access of stochastical...
research
06/14/2018

Scalable load balancing in networked systems: A survey of recent advances

The basic load balancing scenario involves a single dispatcher where tas...
research
06/04/2017

Load Balancing in Large-Scale Systems with Multiple Dispatchers

Load balancing algorithms play a crucial role in delivering robust appli...
research
03/24/2017

Optimal Service Elasticity in Large-Scale Distributed Systems

A fundamental challenge in large-scale cloud networks and data centers i...
research
08/17/2020

Load Balancing Under Strict Compatibility Constraints

We study large-scale systems operating under the JSQ(d) policy in the pr...
research
06/06/2022

CARE: Resource Allocation Using Sparse Communication

We propose a new framework for studying effective resource allocation in...

Please sign up or login with your details

Forgot password? Click here to reset