RSS-Based UAV-BS 3-D Mobility Management via Policy Gradient Deep Reinforcement Learning

We address the mobility management of an autonomous UAV-mounted base station (UAV-BS) that provides communication services to a cluster of users on the ground while the geographical characteristics (e.g., location and boundary) of the cluster, the geographical locations of the users, and the characteristics of the radio environment are unknown. UAVBS solely exploits the received signal strengths (RSS) from the users and accordingly chooses its (continuous) 3-D speed to constructively navigate, i.e., improving the transmitted data rate. To compensate for the lack of a model, we adopt policy gradient deep reinforcement learning. As our approach does not rely on any particular information about the users as well as the radio environment, it is flexible and respects the privacy concerns. Our experiments indicate that despite the minimum available information the UAV-BS is able to distinguish between high-rise (often non-line-of-sight dominant) and sub-urban (mainly line-of-sight dominant) environments such that in the former (resp. latter) it tends to reduce (resp. increase) its height and stays close (resp. far) to the cluster. We further observe that the choice of the reward function affects the speed and the ability of the agent to adhere to the problem constraints without affecting the delivered data rate.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/21/2020

Autonomous UAV Navigation: A DDPG-based Deep Reinforcement Learning Approach

In this paper, we propose an autonomous UAV path planning framework usin...
research
12/14/2021

Autonomous Navigation and Configuration of Integrated Access Backhauling for UAV Base Station Using Reinforcement Learning

Fast and reliable connectivity is essential to enhancing situational awa...
research
02/04/2022

5G Network on Wings: A Deep Reinforcement Learning Approach to UAV-based Integrated Access and Backhaul

Fast and reliable wireless communication has become a critical demand in...
research
10/02/2020

REQIBA: Regression and Deep Q-Learning for Intelligent UAV Cellular User to Base Station Association

Unmanned Aerial Vehicles (UAVs) are emerging as important users of next-...
research
08/02/2021

Three-Dimensional Trajectory Design for Multi-User MISO UAV Communications: A Deep Reinforcement Learning Approach

In this paper, we investigate a multi-user downlink multiple-input singl...
research
07/27/2020

Adaptive Height Optimisation for Cellular-Connected UAVs using Reinforcement Learning

With the increasing number of uav as users of the cellular network, the ...

Please sign up or login with your details

Forgot password? Click here to reset