Learning and balancing time-varying loads in large-scale systems

12/18/2020
by   Diego Goldsztajn, et al.
0

Consider a system of n parallel server pools where tasks arrive as a time-varying Poisson process. The system aims at balancing the load by using an inner control loop with an admission threshold to assign incoming tasks to server pools; as an outer control loop, a learning scheme adjusts this threshold over time in steps of Δ units, to keep it aligned with the time-varying overall load. If the fluctuations in the normalized load are smaller than Δ, then we prove that the threshold settles for all large enough n and balances the load when Δ = 1. Our model captures a tradeoff between optimality and stability, since for higher Δ the degree of balance decreases, but the threshold remains constant under larger load fluctuations. The analysis of this model is mathematically challenging, particularly since the learning scheme relies on subtle variations in the occupancy state of the system which vanish on the fluid scale; the methodology developed in this paper overcomes this hurdle by leveraging the tractability of the specific system dynamics. Strong approximations are used to prove certain dynamical properties which are then used to characterize the behavior of the system, without relying on a traditional fluid-limit analysis.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/29/2020

Self-Learning Threshold-Based Load Balancing

We consider a large-scale service system where incoming tasks have to be...
research
12/16/2021

Utility maximizing load balancing policies

Consider a service system where incoming tasks are instantaneously dispa...
research
12/22/2017

Scalable Load Balancing in Networked Systems: Universality Properties and Stochastic Coupling Methods

We present an overview of scalable load balancing algorithms which provi...
research
06/04/2017

Load Balancing in Large-Scale Systems with Multiple Dispatchers

Load balancing algorithms play a crucial role in delivering robust appli...
research
03/24/2017

Optimal Service Elasticity in Large-Scale Distributed Systems

A fundamental challenge in large-scale cloud networks and data centers i...
research
05/27/2020

Parallel Load Balancing on Constrained Client-Server Topologies

We study parallel Load Balancing protocols for a client-server distribut...
research
03/03/2022

Asymptotic Optimality of Speed-Aware JSQ for Heterogeneous Systems

The Join-the-Shortest-Queue (JSQ) load-balancing scheme is known to mini...

Please sign up or login with your details

Forgot password? Click here to reset