Optimal Rate-Matrix Pruning For Large-Scale Heterogeneous Systems

06/01/2023
by   Zhisheng Zhao, et al.
0

We present an analysis of large-scale load balancing systems, where the processing time distribution of tasks depends on both the task and server types. Our study focuses on the asymptotic regime, where the number of servers and task types tend to infinity in proportion. In heterogeneous environments, commonly used load balancing policies such as Join Fastest Idle Queue and Join Fastest Shortest Queue exhibit poor performance and even shrink the stability region. Interestingly, prior to this work, finding a scalable policy with a provable performance guarantee in this setup remained an open question. To address this gap, we propose and analyze two asymptotically delay-optimal dynamic load balancing policies. The first policy efficiently reserves the processing capacity of each server for “good" tasks and routes tasks using the vanilla Join Idle Queue policy. The second policy, called the speed-priority policy, significantly increases the likelihood of assigning tasks to the respective “good" servers capable of processing them at high speeds. By leveraging a framework inspired by the graphon literature and employing the mean-field method and stochastic coupling arguments, we demonstrate that both policies achieve asymptotic zero queuing. Specifically, as the system scales, the probability of a typical task being assigned to an idle server approaches 1.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/22/2017

Scalable Load Balancing in Networked Systems: Universality Properties and Stochastic Coupling Methods

We present an overview of scalable load balancing algorithms which provi...
research
06/24/2020

Scalable Load Balancing in the Presence of Heterogeneous Servers

Heterogeneity is becoming increasingly ubiquitous in modern large-scale ...
research
06/14/2018

Scalable load balancing in networked systems: A survey of recent advances

The basic load balancing scenario involves a single dispatcher where tas...
research
12/16/2021

Utility maximizing load balancing policies

Consider a service system where incoming tasks are instantaneously dispa...
research
10/17/2018

Load balancing with heterogeneous schedulers

Load balancing is a common approach in web server farms or inventory rou...
research
03/20/2018

Join-Idle-Queue with Service Elasticity: Large-Scale Asymptotics of a Non-monotone System

We consider the model of a token-based joint auto-scaling and load balan...
research
11/29/2022

Exploiting Data Locality to Improve Performance of Heterogeneous Server Clusters

We consider load balancing in large-scale heterogeneous server systems i...

Please sign up or login with your details

Forgot password? Click here to reset