Q-learning algorithm for resource allocation in WDMA-based optical wireless communication networks

05/22/2021
by   Abdelrahman S. Elgamal, et al.
0

Visible Light Communication (VLC) has been widely investigated during the last decade due to its ability to provide high data rates with low power consumption. In general, resource management is an important issue in cellular networks that can highly effect their performance. In this paper, an optimisation problem is formulated to assign each user to an optimal access point and a wavelength at a given time. This problem can be solved using mixed integer linear programming (MILP). However, using MILP is not considered a practical solution due to its complexity and memory requirements. In addition, accurate information must be provided to perform the resource allocation. Therefore, the optimisation problem is reformulated using reinforcement learning (RL), which has recently received tremendous interest due to its ability to interact with any environment without prior knowledge. In this paper, we investigate solving the resource allocation optimisation problem in VLC systems using the basic Q-learning algorithm. Two scenarios are simulated to compare the results with the previously proposed MILP model. The results demonstrate the ability of the Q-learning algorithm to provide optimal solutions close to the MILP model without prior knowledge of the system.

READ FULL TEXT
research
06/21/2021

Reinforcement Learning for Resource Allocation in Steerable Laser-based Optical Wireless Systems

Vertical Cavity Surface Emitting Lasers (VCSELs) have demonstrated suita...
research
09/28/2020

Efficient Resource Allocation through Integer Linear Programming: a detailed example

In this paper, we show how a resource allocation problem can be solved t...
research
12/15/2020

Bayesian Optimization for Radio Resource Management: Open Loop Power Control

The purpose of this paper is to provide the reader with an accessible ye...
research
04/10/2019

Optimisation of stochastic networks with blocking: a functional-form approach

Many stochastic networks encountered in practice exhibit some kind of bl...
research
01/27/2021

GymD2D: A Device-to-Device Underlay Cellular Offload Evaluation Platform

Cellular offloading in device-to-device communication is a challenging o...
research
03/03/2020

Accelerating Generalized Benders Decomposition for Wireless Resource Allocation

Generalized Benders decomposition (GBD) is a globally optimal algorithm ...
research
02/13/2020

Mode Selection and Resource Allocation in Sliced Fog Radio Access Networks: A Reinforcement Learning Approach

The mode selection and resource allocation in fog radio access networks ...

Please sign up or login with your details

Forgot password? Click here to reset