A General "Power-of-d" Dispatching Framework for Heterogeneous Systems

12/10/2021
by   Jazeem Abdul Jaleel, et al.
0

Intelligent dispatching is crucial to obtaining low response times in large-scale systems. One common scalable dispatching paradigm is the “power-of-d,” in which the dispatcher queries d servers at random and assigns the job to a server based only on the state of the queried servers. The bulk of power-of-d policies studied in the literature assume that the system is homogeneous, meaning that all servers have the same speed; meanwhile real-world systems often exhibit server speed heterogeneity. This paper introduces a general framework for describing and analyzing heterogeneity-aware power-of-d policies. The key idea behind our framework is that dispatching policies can make use of server speed information at two decision points: when choosing which d servers to query, and when assigning a job to one of those servers. Our framework explicitly separates the dispatching policy into a querying rule and an assignment rule; we consider general families of both rule types. While the strongest assignment rules incorporate both detailed queue-length information and server speed information, these rules typically are difficult to analyze. We overcome this difficulty by focusing on heterogeneity-aware assignment rules that ignore queue length information beyond idleness status. In this setting, we analyze mean response time and formulate novel optimization problems for the joint optimization of querying and assignment. We build upon our optimized policies to develop heuristic queue length-aware dispatching policies. Our heuristic policies perform well in simulation, relative to policies that have appeared in the literature.

READ FULL TEXT
research
06/24/2020

Scalable Load Balancing in the Presence of Heterogeneous Servers

Heterogeneity is becoming increasingly ubiquitous in modern large-scale ...
research
11/16/2020

Improved Load Balancing in Large Scale Systems using Attained Service Time Reporting

Our interest lies in load balancing jobs in large scale systems consisti...
research
08/20/2020

Optimal Load Balancing in Bipartite Graphs

Applications in cloud platforms motivate the study of efficient load bal...
research
10/26/2020

Load balancing policies with server-side cancellation of replicas

Popular dispatching policies such as the join shortest queue (JSQ), join...
research
07/21/2019

Multiple Server SRPT with speed scaling is competitive

Can the popular shortest remaining processing time (SRPT) algorithm achi...
research
12/12/2021

A Restless Bandit Model for Energy-Efficient Job Assignments in Server Farms

We aim to maximize the energy efficiency, gauged as average energy cost ...
research
03/01/2023

The Power of Two Choices with Load Comparison Errors

In this paper, we analyze the effects of erroneous load comparisons on t...

Please sign up or login with your details

Forgot password? Click here to reset