Optimal Load Balancing in Bipartite Graphs

08/20/2020
by   Wentao Weng, et al.
0

Applications in cloud platforms motivate the study of efficient load balancing under job-server constraints and server heterogeneity. In this paper, we study load balancing on a bipartite graph where left nodes correspond to job types and right nodes correspond to servers, with each edge indicating that a job type can be served by a server. Thus edges represent locality constraints, i.e., each job can only be served at servers which contained certain data and/or machine learning (ML) models. Servers in this system can have heterogeneous service rates. In this setting, we investigate the performance of two policies named Join-the-Fastest-of-the-Shortest-Queue (JFSQ) and Join-the-Fastest-of-the-Idle-Queue (JFIQ), which are simple variants of Join-the-Shortest-Queue and Join-the-Idle-Queue, where ties are broken in favor of the fastest servers. Under a "well-connected" graph condition, we show that JFSQ and JFIQ are asymptotically optimal in the mean response time when the number of servers goes to infinity. In addition to asymptotic optimality, we also obtain upper bounds on the mean response time for finite-size systems. We further show that the well-connectedness condition can be satisfied by a random bipartite graph construction with relatively sparse connectivity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/26/2020

Load balancing policies with server-side cancellation of replicas

Popular dispatching policies such as the join shortest queue (JSQ), join...
research
09/17/2021

Scheduling in Parallel Finite Buffer Systems: Optimal Decisions under Delayed Feedback

Scheduling decisions in parallel queuing systems arise as a fundamental ...
research
07/18/2017

Asymptotically Optimal Load Balancing Topologies

We consider a system of N servers inter-connected by some underlying gra...
research
11/29/2022

Exploiting Data Locality to Improve Performance of Heterogeneous Server Clusters

We consider load balancing in large-scale heterogeneous server systems i...
research
08/17/2020

Load Balancing Under Strict Compatibility Constraints

We study large-scale systems operating under the JSQ(d) policy in the pr...
research
03/01/2023

The Power of Two Choices with Load Comparison Errors

In this paper, we analyze the effects of erroneous load comparisons on t...
research
12/10/2021

A General "Power-of-d" Dispatching Framework for Heterogeneous Systems

Intelligent dispatching is crucial to obtaining low response times in la...

Please sign up or login with your details

Forgot password? Click here to reset