Continual Meta-Reinforcement Learning for UAV-Aided Vehicular Wireless Networks

07/13/2022
by   Riccardo Marini, et al.
0

Unmanned aerial base stations (UABSs) can be deployed in vehicular wireless networks to support applications such as extended sensing via vehicle-to-everything (V2X) services. A key problem in such systems is designing algorithms that can efficiently optimize the trajectory of the UABS in order to maximize coverage. In existing solutions, such optimization is carried out from scratch for any new traffic configuration, often by means of conventional reinforcement learning (RL). In this paper, we propose the use of continual meta-RL as a means to transfer information from previously experienced traffic configurations to new conditions, with the goal of reducing the time needed to optimize the UABS's policy. Adopting the Continual Meta Policy Search (CoMPS) strategy, we demonstrate significant efficiency gains as compared to conventional RL, as well as to naive transfer learning methods.

READ FULL TEXT
research
12/08/2021

CoMPS: Continual Meta Policy Search

We develop a new continual meta-learning method to address challenges in...
research
10/01/2022

Integrating Conventional Headway Control with Reinforcement Learning to Avoid Bus Bunching

Bus bunching is a natural-occurring phenomenon that undermines the effic...
research
06/05/2023

Catch Me If You Can: Deep Meta-RL for Search-and-Rescue using LoRa UAV Networks

Long range (LoRa) wireless networks have been widely proposed as a effic...
research
07/22/2019

VRLS: A Unified Reinforcement Learning Scheduler for Vehicle-to-Vehicle Communications

Vehicle-to-vehicle (V2V) communications have distinct challenges that ne...
research
10/21/2022

Continual Reinforcement Learning with Group Symmetries

Continual reinforcement learning (RL) aims to learn a sequence of tasks ...
research
05/23/2021

Continual World: A Robotic Benchmark For Continual Reinforcement Learning

Continual learning (CL) – the ability to continuously learn, building on...
research
04/27/2020

FORECASTER: A Continual Lifelong Learning Approach to Improve Hardware Efficiency

Computer applications are continuously evolving. However, significant kn...

Please sign up or login with your details

Forgot password? Click here to reset