Stability, memory, and messaging tradeoffs in heterogeneous service systems

07/10/2020
by   David Gamarnik, et al.
0

We consider a heterogeneous distributed service system, consisting of n servers with unknown and possibly different processing rates. Jobs with unit mean and independent processing times arrive as a renewal process of rate λ n, with 0<λ<1, to the system. Incoming jobs are immediately dispatched to one of several queues associated with the n servers. We assume that the dispatching decisions are made by a central dispatcher endowed with a finite memory, and with the ability to exchange messages with the servers. We study the fundamental resource requirements (memory bits and message exchange rate) in order for a dispatching policy to be maximally stable, i.e., stable whenever the processing rates are such that the arrival rate is less than the total available processing rate. First, for the case of Poisson arrivals and exponential service times, we present a policy that is maximally stable while using a positive (but arbitrarily small) message rate, and log_2(n) bits of memory. Second, we show that within a certain broad class of policies, a dispatching policy that exchanges o(n^2) messages per unit of time, and with o(log(n)) bits of memory, cannot be maximally stable. Thus, as long as the message rate is not too excessive, a logarithmic memory is necessary and sufficient for maximal stability.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/28/2020

A Theory of Auto-Scaling for Resource Reservation in Cloud Services

We consider a distributed server system consisting of a large number of ...
research
10/21/2019

Delay-optimal policies in partial fork-join systems with redundancy and random slowdowns

We consider a large distributed service system consisting of n homogeneo...
research
04/30/2020

A Lower Bound on the stability region of Redundancy-d with FIFO service discipline

Redundancy-d (R(d)) is a load balancing method used to route incoming jo...
research
10/04/2019

System Stability Under Adversarial Injection of Dependent Tasks

In this work, we consider a computational model of a distributed system ...
research
04/26/2023

Performance of the Gittins Policy in the G/G/1 and G/G/k, With and Without Setup Times

How should we schedule jobs to minimize mean queue length? In the preemp...
research
10/19/2021

Stability analysis of two-class retrial systems with constant retrial rates and general service times

We establish stability criterion for a two-class retrial system with Poi...
research
11/20/2022

Best-Effort Communication Improves Performance and Scales Robustly on Conventional Hardware

Here, we test the performance and scalability of fully-asynchronous, bes...

Please sign up or login with your details

Forgot password? Click here to reset