Optimal Learning for Dynamic Coding in Deadline-Constrained Multi-Channel Networks

11/27/2018
by   Semih Cayci, et al.
0

We study the problem of serving randomly arriving and delay-sensitive traffic over a multi-channel communication system with time-varying channel states and unknown statistics. This problem deviates from the classical exploration-exploitation setting in that the design and analysis must accommodate the dynamics of packet availability and urgency as well as the cost of each channel use at the time of decision. To that end, we have developed and investigated an index-based policy UCB-Deadline, which performs dynamic channel allocation decisions that incorporate these traffic requirements and costs. Under symmetric channel conditions, we have proved that the UCB-Deadline policy can achieve bounded regret in the likely case where the cost of using a channel is not too high to prevent all transmissions, and logarithmic regret otherwise. In this case, we show that UCB-Deadline is order-optimal. We also perform numerical investigations to validate the theoretical findings, and also compare the performance of the UCB-Deadline to another learning algorithm that we propose based on Thompson Sampling.

READ FULL TEXT
research
11/06/2018

Risk-Sensitive Reinforcement Learning for URLLC Traffic in Wireless Networks

In this paper, we study the problem of dynamic channel allocation for UR...
research
02/15/2019

Heat-Diffusion: Pareto Optimal Dynamic Routing for Time-Varying Wireless Networks

A dynamic routing policy, referred to as Heat-Diffusion (HD), is develop...
research
02/20/2018

Deep Reinforcement Learning for Dynamic Multichannel Access in Wireless Networks

We consider a dynamic multichannel access problem, where multiple correl...
research
07/16/2022

Sampling of the Wiener Process for Remote Estimation over a Channel with Unknown Delay Statistics

In this paper, we study an online sampling problem of the Wiener process...
research
10/12/2019

Thompson Sampling in Non-Episodic Restless Bandits

Restless bandit problems assume time-varying reward distributions of the...
research
04/11/2018

Cost-Aware Learning and Optimization for Opportunistic Spectrum Access

In this paper, we investigate cost-aware joint learning and optimization...
research
08/03/2009

Regret Bounds for Opportunistic Channel Access

We consider the task of opportunistic channel access in a primary system...

Please sign up or login with your details

Forgot password? Click here to reset