Learning-based Optimal Admission Control in a Single Server Queuing System

12/21/2022
by   Asaf Cohen, et al.
0

We consider a long-term average profit maximizing admission control problem in an M/M/1 queuing system with a known arrival rate but an unknown service rate. With a fixed reward collected upon service completion and a cost per unit of time enforced on customers waiting in the queue, a dispatcher decides upon arrivals whether to admit the arriving customer or not based on the full history of observations of the queue-length of the system. <cit.> showed that if all the parameters of the model are known, then it is optimal to use a static threshold policy - admit if the queue-length is less than a predetermined threshold and otherwise not. We propose a learning-based dispatching algorithm and characterize its regret with respect to optimal dispatch policies for the full information model of <cit.>. We show that the algorithm achieves an O(1) regret when all optimal thresholds with full information are non-zero, and achieves an O(ln^3+ϵ(N)) regret in the case that an optimal threshold with full information is 0 (i.e., an optimal policy is to reject all arrivals), where N is the number of arrivals and ϵ>0.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/10/2016

Optimal routing in two-queue polling systems

We consider a polling system with two queues, exhaustive service, no swi...
research
02/04/2022

Learning a Discrete Set of Optimal Allocation Rules in Queueing Systems with Unknown Service Rates

We study learning-based admission control for a classical Erlang-B block...
research
11/01/2022

Towards Maximizing Nonlinear Delay Sensitive Rewards in Queuing Systems

We consider maximizing the long-term average reward in a single server q...
research
02/02/2018

On Learning the cμ Rule in Single and Parallel Server Networks

We consider learning-based variants of the c μ rule for scheduling in si...
research
05/11/2020

Learning Algorithms for Minimizing Queue Length Regret

We consider a system consisting of a single transmitter/receiver pair an...
research
02/02/2018

On Learning the cμ Rule: Single and Multiserver Settings

We consider learning-based variants of the c μ rule -- a classic and wel...
research
01/24/2022

Structural Properties of Optimal Fidelity Selection Policies for Human-in-the-loop Queues

We study optimal fidelity selection for a human operator servicing a que...

Please sign up or login with your details

Forgot password? Click here to reset