On Learning the cμ Rule: Single and Multiserver Settings

02/02/2018
by   Subhashini Krishnasamy, et al.
0

We consider learning-based variants of the c μ rule -- a classic and well-studied scheduling policy -- in single and multi-server settings for multi-class queueing systems. In the single server setting, the c μ rule is known to minimize the expected holding-cost (weighted queue-lengths summed both over classes and time). We focus on the setting where the service rates μ are unknown, and are interested in the holding-cost regret -- the difference in the expected holding-costs between that induced by a learning-based rule (that learns μ) and that from the c μ rule (which has knowledge of the service rates) over any fixed time horizon. We first show that empirically learning the service rates and then scheduling using these learned values results in a regret of holding-cost that does not depend on the time horizon. The key insight that allows such a constant regret bound is that a work-conserving scheduling policy in this setting allows explore-free learning, where no penalty is incurred for exploring and learning server rates. We next consider the multi-server setting. We show that in general, the c μ rule is not stabilizing (i.e. there are stabilizable arrival and service rate parameters for which the multi-server c μ rule results in unstable queues). We then characterize sufficient conditions for stability (and also concentrations on busy periods). Using these results, we show that learning-based variants of the cμ rule again result in a constant regret (i.e. does not depend on the time horizon). This result hinges on (i) the busy period concentrations of the multi-server c μ rule, and that (ii) our learning-based rule is designed to dynamically explore server rates, but in such a manner that it eventually satisfies an explore-free condition.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/02/2018

On Learning the cμ Rule in Single and Parallel Server Networks

We consider learning-based variants of the c μ rule for scheduling in si...
research
12/21/2022

Learning-based Optimal Admission Control in a Single Server Queuing System

We consider a long-term average profit maximizing admission control prob...
research
02/20/2020

Queueing Subject To Action-Dependent Server Performance: Utilization Rate Reduction

We consider a discrete-time system comprising a first-come-first-served ...
research
02/04/2022

Learning a Discrete Set of Optimal Allocation Rules in Queueing Systems with Unknown Service Rates

We study learning-based admission control for a classical Erlang-B block...
research
09/24/2019

Stability and Instability of the MaxWeight Policy

Consider a switched queueing network with general routing among its queu...
research
03/03/2023

Queue Scheduling with Adversarial Bandit Learning

In this paper, we study scheduling of a queueing system with zero knowle...
research
08/22/2022

ABL: An original active blacklist based on a modification of the SMTP

This paper presents a novel Active Blacklist (ABL) based on a modificati...

Please sign up or login with your details

Forgot password? Click here to reset