Risk-Sensitive Reinforcement Learning for URLLC Traffic in Wireless Networks

11/06/2018
by   Nesrine Ben-Khalifa, et al.
0

In this paper, we study the problem of dynamic channel allocation for URLLC traffic in a multi-user multi-channel wireless network where urgent packets have to be successfully transmitted in a timely manner. We formulate the problem as a finite-horizon Markov Decision Process with a stochastic constraint related to the QoS requirement, defined as the packet loss rate for each user. We propose a novel weighted formulation that takes into account both the total expected reward (number of successfully transmitted packets) and the risk which we define as the QoS requirement violation. First, we use the value iteration algorithm to find the optimal policy, which assumes a perfect knowledge of the controller of all the parameters, namely the channel statistics. We then propose a Q-learning algorithm where the controller learns the optimal policy without having knowledge of neither the CSI nor the channel statistics. We illustrate the performance of our algorithms with numerical studies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/27/2017

A Simple Reinforcement Learning Mechanism for Resource Allocation in LTE-A Networks with Markov Decision Process and Q-Learning

Resource allocation is still a difficult issue to deal with in wireless ...
research
11/27/2018

Optimal Learning for Dynamic Coding in Deadline-Constrained Multi-Channel Networks

We study the problem of serving randomly arriving and delay-sensitive tr...
research
02/20/2018

Deep Reinforcement Learning for Dynamic Multichannel Access in Wireless Networks

We consider a dynamic multichannel access problem, where multiple correl...
research
10/21/2022

Optimal Policies for Age and Distortion in a Discrete-Time Model

We study a discrete-time model where each packet has a cost of not being...
research
05/27/2020

Wireless Feedback Control with Variable Packet Length for Industrial IoT

The paper considers a wireless networked control system (WNCS), where a ...
research
09/17/2020

Knowledge-Assisted Deep Reinforcement Learning in 5G Scheduler Design: From Theoretical Framework to Implementation

In this paper, we develop a knowledge-assisted deep reinforcement learni...
research
07/21/2020

Flow Sampling: Accurate and Load-balanced Sampling Policies

Software-defined networking simplifies network monitoring by means of pe...

Please sign up or login with your details

Forgot password? Click here to reset