Model-aided Federated Reinforcement Learning for Multi-UAV Trajectory Planning in IoT Networks

06/03/2023
by   Jichao Chen, et al.
0

Deploying teams of cooperative unmanned aerial vehicles (UAVs) to harvest data from distributed Internet of Things (IoT) devices requires efficient trajectory planning and coordination algorithms. Multi-agent reinforcement learning (MARL) has emerged as an effective solution, but often requires extensive and costly real-world training data. In this paper, we propose a novel model-aided federated MARL algorithm to coordinate multiple UAVs on a data harvesting mission with limited knowledge about the environment, significantly reducing the real-world training data demand. The proposed algorithm alternates between learning an environment model from real-world measurements and federated QMIX training in the simulated environment. Specifically, collected measurements from the real-world environment are used to learn the radio channel and estimate unknown IoT device locations to create a simulated environment. Each UAV agent trains a local QMIX model in its simulated environment and continuously consolidates it through federated learning with other agents, accelerating the learning process and further improving training sample efficiency. Simulation results demonstrate that our proposed model-aided FedQMIX algorithm substantially reduces the need for real-world training experiences while attaining similar data collection performance as standard MARL algorithms.

READ FULL TEXT

page 1

page 6

research
07/27/2023

Multi-Agent Graph Reinforcement Learning based On-Demand Wireless Energy Transfer in Multi-UAV-aided IoT Network

This paper proposes a new on-demand wireless energy transfer (WET) schem...
research
04/21/2021

Model-aided Deep Reinforcement Learning for Sample-efficient UAV Trajectory Design in IoT Networks

Deep Reinforcement Learning (DRL) is gaining attention as a potential ap...
research
10/23/2020

Multi-UAV Path Planning for Wireless Data Harvesting with Deep Reinforcement Learning

Harvesting data from distributed Internet of Things (IoT) devices with m...
research
12/01/2021

Joint Cluster Head Selection and Trajectory Planning in UAV-Aided IoT Networks by Reinforcement Learning with Sequential Model

Employing unmanned aerial vehicles (UAVs) has attracted growing interest...
research
07/01/2020

UAV Path Planning for Wireless Data Harvesting: A Deep Reinforcement Learning Approach

Autonomous deployment of unmanned aerial vehicles (UAVs) supporting next...
research
10/10/2022

FedBA: Non-IID Federated Learning Framework in UAV Networks

With the development and progress of science and technology, the Interne...
research
03/20/2018

Cooperative and Distributed Reinforcement Learning of Drones for Field Coverage

This paper proposed a distributed Multi-Agent Reinforcement Learning (MA...

Please sign up or login with your details

Forgot password? Click here to reset