Q-learning algorithm for resource allocation in WDMA-based optical wireless communication networks

by   Abdelrahman S. Elgamal, et al.

Visible Light Communication (VLC) has been widely investigated during the last decade due to its ability to provide high data rates with low power consumption. In general, resource management is an important issue in cellular networks that can highly effect their performance. In this paper, an optimisation problem is formulated to assign each user to an optimal access point and a wavelength at a given time. This problem can be solved using mixed integer linear programming (MILP). However, using MILP is not considered a practical solution due to its complexity and memory requirements. In addition, accurate information must be provided to perform the resource allocation. Therefore, the optimisation problem is reformulated using reinforcement learning (RL), which has recently received tremendous interest due to its ability to interact with any environment without prior knowledge. In this paper, we investigate solving the resource allocation optimisation problem in VLC systems using the basic Q-learning algorithm. Two scenarios are simulated to compare the results with the previously proposed MILP model. The results demonstrate the ability of the Q-learning algorithm to provide optimal solutions close to the MILP model without prior knowledge of the system.


Reinforcement Learning for Resource Allocation in Steerable Laser-based Optical Wireless Systems

Vertical Cavity Surface Emitting Lasers (VCSELs) have demonstrated suita...

Efficient Resource Allocation through Integer Linear Programming: a detailed example

In this paper, we show how a resource allocation problem can be solved t...

Bayesian Optimization for Radio Resource Management: Open Loop Power Control

The purpose of this paper is to provide the reader with an accessible ye...

Optimisation of stochastic networks with blocking: a functional-form approach

Many stochastic networks encountered in practice exhibit some kind of bl...

GymD2D: A Device-to-Device Underlay Cellular Offload Evaluation Platform

Cellular offloading in device-to-device communication is a challenging o...

Accelerating Generalized Benders Decomposition for Wireless Resource Allocation

Generalized Benders decomposition (GBD) is a globally optimal algorithm ...

Mode Selection and Resource Allocation in Sliced Fog Radio Access Networks: A Reinforcement Learning Approach

The mode selection and resource allocation in fog radio access networks ...

Please sign up or login with your details

Forgot password? Click here to reset