Distributed Proximal Policy Optimization for Contention-Based Spectrum Access

10/07/2021
by   Akash Doshi, et al.
0

The increasing number of wireless devices operating in unlicensed spectrum motivates the development of intelligent adaptive approaches to spectrum access that go beyond traditional carrier sensing. We develop a novel distributed implementation of a policy gradient method known as Proximal Policy Optimization modelled on a two stage Markov decision process that enables such an intelligent approach, and still achieves decentralized contention-based medium access. In each time slot, a base station (BS) uses information from spectrum sensing and reception quality to autonomously decide whether or not to transmit on a given resource, with the goal of maximizing proportional fairness network-wide. Empirically, we find the proportional fairness reward accumulated by the policy gradient approach to be significantly higher than even a genie-aided adaptive energy detection threshold. This is further validated by the improved sum and maximum user throughputs achieved by our approach.

READ FULL TEXT
research
09/24/2021

Distributed Deep Reinforcement Learning for Adaptive Medium Access and Modulation in Shared Spectrum

Spectrum scarcity has led to growth in the use of unlicensed spectrum fo...
research
10/05/2021

A Deep Reinforcement Learning Framework for Contention-Based Spectrum Sharing

The increasing number of wireless devices operating in unlicensed spectr...
research
07/14/2021

Learning-based Spectrum Sensing and Access in Cognitive Radios via Approximate POMDPs

A novel LEarning-based Spectrum Sensing and Access (LESSA) framework is ...
research
08/24/2018

Proximal Policy Optimization and its Dynamic Version for Sequence Generation

In sequence generation task, many works use policy gradient for model op...
research
03/25/2018

Optimal Spectrum Sensing Policy with Traffic Classification in RF-Powered CRNs

An orthogonal frequency division multiple access (OFDMA)-based primary u...
research
04/06/2020

Multi-Agent Deep Stochastic Policy Gradient for Event Based Dynamic Spectrum Access

We consider the dynamic spectrum access (DSA) problem where K Internet o...
research
01/24/2019

On the Complexity of Approximating Wasserstein Barycenter

We study the complexity of approximating Wassertein barycenter of m disc...

Please sign up or login with your details

Forgot password? Click here to reset