Reinforcement Learning for Physical Layer Communications

06/22/2021
by   Philippe Mary, et al.
0

In this chapter, we will give comprehensive examples of applying RL in optimizing the physical layer of wireless communications by defining different class of problems and the possible solutions to handle them. In Section 9.2, we present all the basic theory needed to address a RL problem, i.e. Markov decision process (MDP), Partially observable Markov decision process (POMDP), but also two very important and widely used algorithms for RL, i.e. the Q-learning and SARSA algorithms. We also introduce the deep reinforcement learning (DRL) paradigm and the section ends with an introduction to the multi-armed bandits (MAB) framework. Section 9.3 focuses on some toy examples to illustrate how the basic concepts of RL are employed in communication systems. We present applications extracted from literature with simplified system models using similar notation as in Section 9.2 of this Chapter. In Section 9.3, we also focus on modeling RL problems, i.e. how action and state spaces and rewards are chosen. The Chapter is concluded in Section 9.4 with a prospective thought on RL trends and it ends with a review of a broader state of the art in Section 9.5.

READ FULL TEXT

page 27

page 29

page 30

page 32

research
05/11/2018

Deep Hierarchical Reinforcement Learning Algorithm in Partially Observable Markov Decision Processes

In recent years, reinforcement learning has achieved many remarkable suc...
research
01/15/2021

Reinforcement learning based recommender systems: A survey

Recommender systems (RSs) are becoming an inseparable part of our everyd...
research
07/10/2022

An Introduction to Lifelong Supervised Learning

This primer is an attempt to provide a detailed summary of the different...
research
02/10/2023

A Survey on Causal Reinforcement Learning

While Reinforcement Learning (RL) achieves tremendous success in sequent...
research
05/19/2019

Reinforcement Learning for Learning of Dynamical Systems in Uncertain Environment: a Tutorial

In this paper, a review of model-free reinforcement learning for learnin...
research
08/06/2020

A Gentle Lecture Note on Filtrations in Reinforcement Learning

This note aims to provide a basic intuition on the concept of filtration...
research
06/05/2023

An Interpretive Framework for Narrower Immunity Under Section 230 of the Communications Decency Act

Almost all courts to interpret Section 230 of the Communications Decency...

Please sign up or login with your details

Forgot password? Click here to reset