Learning Mean-Field Control for Delayed Information Load Balancing in Large Queuing Systems

08/09/2022
by   Anam Tahir, et al.
0

Recent years have seen a great increase in the capacity and parallel processing power of data centers and cloud services. To fully utilize the said distributed systems, optimal load balancing for parallel queuing architectures must be realized. Existing state-of-the-art solutions fail to consider the effect of communication delays on the behaviour of very large systems with many clients. In this work, we consider a multi-agent load balancing system, with delayed information, consisting of many clients (load balancers) and many parallel queues. In order to obtain a tractable solution, we model this system as a mean-field control problem with enlarged state-action space in discrete time through exact discretization. Subsequently, we apply policy gradient reinforcement learning algorithms to find an optimal load balancing solution. Here, the discrete-time system model incorporates a synchronization delay under which the queue state information is synchronously broadcasted and updated at all clients. We then provide theoretical performance guarantees for our methodology in large systems. Finally, using experiments, we prove that our approach is not only scalable but also shows good performance when compared to the state-of-the-art power-of-d variant of the Join-the-Shortest-Queue (JSQ) and other policies in the presence of synchronization delays.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/18/2017

Network Load Balancing Methods: Experimental Comparisons and Improvement

Load balancing algorithms play critical roles in systems where the workl...
research
04/30/2021

Discrete-Time Mean Field Control with Environment States

Multi-agent reinforcement learning methods have shown remarkable potenti...
research
12/12/2016

Geographical Load Balancing across Green Datacenters

"Geographic Load Balancing" is a strategy for reducing the energy cost o...
research
12/22/2017

Scalable Load Balancing in Networked Systems: Universality Properties and Stochastic Coupling Methods

We present an overview of scalable load balancing algorithms which provi...
research
06/14/2018

Scalable load balancing in networked systems: A survey of recent advances

The basic load balancing scenario involves a single dispatcher where tas...
research
08/03/2020

Distributed Dispatching in the Parallel Server Model

With the rapid increase in the size and volume of cloud services and dat...
research
06/06/2022

CARE: Resource Allocation Using Sparse Communication

We propose a new framework for studying effective resource allocation in...

Please sign up or login with your details

Forgot password? Click here to reset