Scalable Load Balancing in the Presence of Heterogeneous Servers

06/24/2020
by   Kristen Gardner, et al.
0

Heterogeneity is becoming increasingly ubiquitous in modern large-scale computer systems. Developing good load balancing policies for systems whose resources have varying speeds is crucial in achieving low response times. Indeed, how best to dispatch jobs to servers is a classical and well-studied problem in the queueing literature. Yet the bulk of existing work on large-scale systems assumes homogeneous servers; unfortunately, policies that perform well in the homogeneous setting can cause unacceptably poor performance—or even instability—in heterogeneous systems. We adapt the "power-of-d" versions of both the Join-the-Idle-Queue and Join-the-Shortest-Queue policies to design two corresponding families of heterogeneity-aware dispatching policies, each of which is parameterized by a pair of routing probabilities. Unlike their heterogeneity-unaware counterparts, our policies use server speed information both when choosing which servers to query and when probabilistically deciding where (among the queried servers) to dispatch jobs. Both of our policy families are analytically tractable: our mean response time and queue length distribution analyses are exact as the number of servers approaches infinity, under standard assumptions. Furthermore, our policy families achieve maximal stability and outperform well-known dispatching rules—including heterogeneity-aware policies such as Shortest-Expected-Delay—with respect to mean response time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/10/2021

A General "Power-of-d" Dispatching Framework for Heterogeneous Systems

Intelligent dispatching is crucial to obtaining low response times in la...
research
03/03/2022

Asymptotic Optimality of Speed-Aware JSQ for Heterogeneous Systems

The Join-the-Shortest-Queue (JSQ) load-balancing scheme is known to mini...
research
06/01/2023

Optimal Rate-Matrix Pruning For Large-Scale Heterogeneous Systems

We present an analysis of large-scale load balancing systems, where the ...
research
10/26/2020

Load balancing policies with server-side cancellation of replicas

Popular dispatching policies such as the join shortest queue (JSQ), join...
research
04/02/2020

Heavy Traffic Analysis of the Mean Response Time for Load Balancing Policies in the Mean Field Regime

Mean field models are a popular tool used to analyse load balancing poli...
research
03/04/2020

LSQ: Load Balancing in Large-Scale Heterogeneous Systems with Multiple Dispatchers

Nowadays, the efficiency and even the feasibility of traditional load-ba...
research
11/29/2022

Exploiting Data Locality to Improve Performance of Heterogeneous Server Clusters

We consider load balancing in large-scale heterogeneous server systems i...

Please sign up or login with your details

Forgot password? Click here to reset