Model-free Reinforcement Learning of Semantic Communication by Stochastic Policy Gradient

05/05/2023
by   Edgar Beck, et al.
0

Motivated by the recent success of Machine Learning tools in wireless communications, the idea of semantic communication by Weaver from 1949 has gained attention. It breaks with Shannon's classic design paradigm by aiming to transmit the meaning, i.e., semantics, of a message instead of its exact version, allowing for information rate savings. In this work, we apply the Stochastic Policy Gradient (SPG) to design a semantic communication system by reinforcement learning, not requiring a known or differentiable channel model - a crucial step towards deployment in practice. Further, we motivate the use of SPG for both classic and semantic communication from the maximization of the mutual information between received and target variables. Numerical results show that our approach achieves comparable performance to a model-aware approach based on the reparametrization trick, albeit with a decreased convergence rate.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/28/2022

Semantic Communication: An Information Bottleneck View

Motivated by recent success of machine learning tools at the PHY layer a...
research
06/01/2021

Reinforce Security: A Model-Free Approach Towards Secure Wiretap Coding

The use of deep learning-based techniques for approximating secure encod...
research
05/15/2022

Policy Gradient Method For Robust Reinforcement Learning

This paper develops the first policy gradient method with global optimal...
research
03/01/2020

A Hybrid Stochastic Policy Gradient Algorithm for Reinforcement Learning

We propose a novel hybrid stochastic policy gradient estimator by combin...
research
10/01/2021

What is Semantic Communication? A View on Conveying Meaning in the Era of Machine Intelligence

In 1940s, Claude Shannon developed the information theory focusing on qu...
research
06/30/2021

Inverse Design of Grating Couplers Using the Policy Gradient Method from Reinforcement Learning

We present a proof-of-concept technique for the inverse design of electr...

Please sign up or login with your details

Forgot password? Click here to reset