Join-Idle-Queue with Service Elasticity: Large-Scale Asymptotics of a Non-monotone System

03/20/2018
by   Debankur Mukherjee, et al.
0

We consider the model of a token-based joint auto-scaling and load balancing strategy, proposed in a recent paper by Mukherjee, Dhara, Borst, and van Leeuwaarden (SIGMETRICS '17, arXiv:1703.08373), which offers an efficient scalable implementation and yet achieves asymptotically optimal steady-state delay performance and energy consumption as the number of servers N→∞. In the above work, the asymptotic results are obtained under the assumption that the queues have fixed-size finite buffers, and therefore the fundamental question of stability of the proposed scheme with infinite buffers was left open. In this paper, we address this fundamental stability question. The system stability under the usual subcritical load assumption is not automatic. Moreover, the stability may not even hold for all N. The key challenge stems from the fact that the process lacks monotonicity, which has been the powerful primary tool for establishing stability in load balancing models. We develop a novel method to prove that the subcritically loaded system is stable for large enough N, and establish convergence of steady-state distributions to the optimal one, as N →∞. The method goes beyond the state of the art techniques -- it uses an induction-based idea and a "weak monotonicity" property of the model; this technique is of independent interest and may have broader applicability.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/24/2017

Optimal Service Elasticity in Large-Scale Distributed Systems

A fundamental challenge in large-scale cloud networks and data centers i...
research
06/14/2018

Scalable load balancing in networked systems: A survey of recent advances

The basic load balancing scenario involves a single dispatcher where tas...
research
06/01/2023

Optimal Rate-Matrix Pruning For Large-Scale Heterogeneous Systems

We present an analysis of large-scale load balancing systems, where the ...
research
04/04/2022

Asynchronous Load Balancing and Auto-scaling: Mean-Field Limit and Optimal Design

We introduce a Markovian framework for load balancing where classical al...
research
03/03/2022

Asymptotic Optimality of Speed-Aware JSQ for Heterogeneous Systems

The Join-the-Shortest-Queue (JSQ) load-balancing scheme is known to mini...
research
02/16/2022

Large-System Insensitivity of Zero-Waiting Load Balancing Algorithms

This paper studies the sensitivity (or insensitivity) of a class of load...
research
02/07/2019

A Random Access G-Network: Stability, Stable Throughput, and Queueing Analysis

The effect of signals on stability, throughput region, and delay in a tw...

Please sign up or login with your details

Forgot password? Click here to reset