Deep Reinforcement Learning for Dynamic Multichannel Access in Wireless Networks

02/20/2018
by   Shangxing Wang, et al.
0

We consider a dynamic multichannel access problem, where multiple correlated channels follow an unknown joint Markov model. A user at each time slot selects a channel to transmit data and receives a reward based on the success or failure of the transmission. The objective is to find a policy that maximizes the expected long-term reward. The problem is formulated as a partially observable Markov decision process (POMDP) with unknown system dynamics. To overcome the challenges of unknown system dynamics as well as prohibitive computation, we apply the concept of reinforcement learning and implement a Deep Q-Network (DQN) that can deal with large state space without any prior knowledge of the system dynamics. We provide an analytical study on the optimal policy for fixed-pattern channel switching with known system dynamics and show through simulations that DQN can achieve the same optimal performance without knowing the system statistics. We compare the performance of DQN with a Myopic policy and a Whittle Index-based heuristic through both simulations as well as real-data trace and show that DQN achieves near-optimal performance in more complex situations. Finally, we propose an adaptive DQN approach with the capability to adapt its learning in time-varying, dynamic scenarios.

READ FULL TEXT

page 17

page 18

research
10/08/2018

Actor-Critic Deep Reinforcement Learning for Dynamic Multichannel Access

We consider the dynamic multichannel access problem, which can be formul...
research
10/24/2021

Deep Reinforcement Learning for Simultaneous Sensing and Channel Access in Cognitive Networks

We consider the problem of dynamic spectrum access (DSA) in cognitive wi...
research
07/02/2019

A Reinforcement Learning Approach for the Multichannel Rendezvous Problem

In this paper, we consider the multichannel rendezvous problem in cognit...
research
11/06/2018

Risk-Sensitive Reinforcement Learning for URLLC Traffic in Wireless Networks

In this paper, we study the problem of dynamic channel allocation for UR...
research
11/27/2018

Optimal Learning for Dynamic Coding in Deadline-Constrained Multi-Channel Networks

We study the problem of serving randomly arriving and delay-sensitive tr...
research
09/23/2021

Opportunistic Spectrum Access: Does Maximizing Throughput Minimize File Transfer Time?

The Opportunistic Spectrum Access (OSA) model has been developed for the...
research
07/09/2022

Optimal policies for Bayesian olfactory search in turbulent flows

In many practical scenarios, a flying insect must search for the source ...

Please sign up or login with your details

Forgot password? Click here to reset