Improved Load Balancing in Large Scale Systems using Attained Service Time Reporting

11/16/2020
by   Tim Hellemans, et al.
0

Our interest lies in load balancing jobs in large scale systems consisting of multiple dispatchers and FCFS servers. In the absence of any information on job sizes, dispatchers typically use queue length information reported by the servers to assign incoming jobs. When job sizes are highly variable, using only queue length information is clearly suboptimal and performance can be improved if some indication can be provided to the dispatcher about the size of an ongoing job. In a FCFS server measuring the attained service time of the ongoing job is easy and servers can therefore report this attained service time together with the queue length when queried by a dispatcher. In this paper we propose and analyse a variety of load balancing policies that exploit both the queue length and attained service time to assign jobs, as well as policies for which only the attained service time of the job in service is used. We present a unified analysis for all these policies in a large scale system under the usual asymptotic independence assumptions. The accuracy of the proposed analysis is illustrated using simulation. We present extensive numerical experiments which clearly indicate that a significant improvement in waiting (and thus also in response) time may be achieved by using the attained service time information on top of the queue length of a server. Moreover, the policies which do not make use of the queue length still provide an improved waiting time for moderately loaded systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/11/2022

Performance of Load Balancers with Bounded Maximum Queue Length in case of Non-Exponential Job Sizes

In large-scale distributed systems, balancing the load in an efficient w...
research
10/26/2020

Load balancing policies with server-side cancellation of replicas

Popular dispatching policies such as the join shortest queue (JSQ), join...
research
04/03/2023

Load Balancing with Job-Size Testing: Performance Improvement or Degradation?

In the context of decision making under explorable uncertainty, scheduli...
research
12/10/2021

A General "Power-of-d" Dispatching Framework for Heterogeneous Systems

Intelligent dispatching is crucial to obtaining low response times in la...
research
05/23/2019

The Supermarket Model with Known and Predicted Service Times

The supermarket model typically refers to a system with a large number o...
research
10/18/2021

Infinite Servers Queue Systems Busy Period Time Length Distribution and Parameters Study through Computational Simulation

A FORTRAN program to simulate the operation of infinite servers queues i...
research
12/13/2021

Scheduling Servers with Stochastic Bilinear Rewards

In this paper we study a multi-class, multi-server queueing system with ...

Please sign up or login with your details

Forgot password? Click here to reset