Learning Algorithms for Minimizing Queue Length Regret

05/11/2020
by   Thomas Stahlbuhk, et al.
0

We consider a system consisting of a single transmitter/receiver pair and N channels over which they may communicate. Packets randomly arrive to the transmitter's queue and wait to be successfully sent to the receiver. The transmitter may attempt a frame transmission on one channel at a time, where each frame includes a packet if one is in the queue. For each channel, an attempted transmission is successful with an unknown probability. The transmitter's objective is to quickly identify the best channel to minimize the number of packets in the queue over T time slots. To analyze system performance, we introduce queue length regret, which is the expected difference between the total queue length of a learning policy and a controller that knows the rates, a priori. One approach to designing a transmission policy would be to apply algorithms from the literature that solve the closely-related stochastic multi-armed bandit problem. These policies would focus on maximizing the number of successful frame transmissions over time. However, we show that these methods have Ω(logT) queue length regret. On the other hand, we show that there exists a set of queue-length based policies that can obtain order optimal O(1) queue length regret. We use our theoretical analysis to devise heuristic methods that are shown to perform well in simulation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/28/2020

Delay Optimal Cross-Layer Scheduling Over Markov Channels with Power Constraint

We consider a scenario where a power constrained transmitter delivers ra...
research
12/21/2022

Learning-based Optimal Admission Control in a Single Server Queuing System

We consider a long-term average profit maximizing admission control prob...
research
12/22/2017

Network Utility Maximization in Adversarial Environments

Stochastic models have been dominant in network optimization theory for ...
research
12/16/2020

Aging Bandits: Regret Analysis and Order-Optimal Learning Algorithm for Wireless Networks with Stochastic Arrivals

We consider a single-hop wireless network with sources transmitting time...
research
11/20/2020

Virtues of Patience in Strategic Queuing Systems

We consider the problem of selfish agents in discrete-time queuing syste...
research
11/12/2021

Bounding Queue Length Violation Probability of Joint Channel and Buffer Aware Transmission

Queue length violation probability, i.e., the tail distribution of the q...
research
02/06/2019

Whittle Index Policy for Multichannel Scheduling in Queueing Systems

In this paper, we consider a queueing system with multiple channels (or ...

Please sign up or login with your details

Forgot password? Click here to reset