Exploiting Data Locality to Improve Performance of Heterogeneous Server Clusters

11/29/2022
by   Zhisheng Zhao, et al.
0

We consider load balancing in large-scale heterogeneous server systems in the presence of data locality that imposes constraints on which tasks can be assigned to which servers. The constraints are naturally captured by a bipartite graph between the servers and the dispatchers handling assignments of various arrival flows. When a task arrives, the corresponding dispatcher assigns it to a server with the shortest queue among d≥ 2 randomly selected servers obeying the above constraints. Server processing speeds are heterogeneous and they depend on the server-type. For a broad class of bipartite graphs, we characterize the limit of the appropriately scaled occupancy process, both on the process-level and in steady state, as the system size becomes large. Using such a characterization, we show that data locality constraints can be used to significantly improve the performance of heterogeneous systems. This is in stark contrast to either heterogeneous servers in a full flexible system or data locality constraints in systems with homogeneous servers, both of which have been observed to degrade the system performance. Extensive numerical experiments corroborate the theoretical results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/17/2020

Load Balancing Under Strict Compatibility Constraints

We study large-scale systems operating under the JSQ(d) policy in the pr...
research
08/20/2020

Optimal Load Balancing in Bipartite Graphs

Applications in cloud platforms motivate the study of efficient load bal...
research
04/05/2018

Dynamic Load Balancing with Tokens

Efficiently exploiting the resources of data centers is a complex task t...
research
06/01/2023

Optimal Rate-Matrix Pruning For Large-Scale Heterogeneous Systems

We present an analysis of large-scale load balancing systems, where the ...
research
09/02/2021

Load Balancing in Heterogeneous Server Clusters: Insights From a Product-Form Queueing Model

Efficiently exploiting servers in data centers requires performance anal...
research
06/24/2020

Scalable Load Balancing in the Presence of Heterogeneous Servers

Heterogeneity is becoming increasingly ubiquitous in modern large-scale ...
research
11/08/2017

Performance of Balanced Fairness in Resource Pools: A Recursive Approach

Understanding the performance of a pool of servers is crucial for proper...

Please sign up or login with your details

Forgot password? Click here to reset