DL-DRL: A double-layer deep reinforcement learning approach for large-scale task scheduling of multi-UAV

08/04/2022
by   Xiao Mao, et al.
5

This paper studies deep reinforcement learning (DRL) for the task scheduling problem of multiple unmanned aerial vehicles (UAVs). Current approaches generally use exact and heuristic algorithms to solve the problem, while the computation time rapidly increases as the task scale grows and heuristic rules need manual design. As a self-learning method, DRL can obtain a high-quality solution quickly without hand-engineered rules. However, the huge decision space makes the training of DRL models becomes unstable in situations with large-scale tasks. In this work, to address the large-scale problem, we develop a divide and conquer-based framework (DCF) to decouple the original problem into a task allocation and a UAV route planning subproblems, which are solved in the upper and lower layers, respectively. Based on DCF, a double-layer deep reinforcement learning approach (DL-DRL) is proposed, where an upper-layer DRL model is designed to allocate tasks to appropriate UAVs and a lower-layer DRL model [i.e., the widely used attention model (AM)] is applied to generate viable UAV routes. Since the upper-layer model determines the input data distribution of the lower-layer model, and its reward is calculated via the lower-layer model during training, we develop an interactive training strategy (ITS), where the whole training process consists of pre-training, intensive training, and alternate training processes. Experimental results show that our DL-DRL outperforms mainstream learning-based and most traditional methods, and is competitive with the state-of-the-art heuristic method [i.e., OR-Tools], especially on large-scale problems. The great generalizability of DL-DRL is also verified by testing the model learned for a problem size to larger ones. Furthermore, an ablation study demonstrates that our ITS can reach a compromise between the model performance and training duration.

READ FULL TEXT

page 2

page 3

page 4

page 5

page 6

page 7

page 10

page 11

research
09/17/2020

SREC: Proactive Self-Remedy of Energy-Constrained UAV-Based Networks via Deep Reinforcement Learning

Energy-aware control for multiple unmanned aerial vehicles (UAVs) is one...
research
09/05/2023

Personalized Federated Deep Reinforcement Learning-based Trajectory Optimization for Multi-UAV Assisted Edge Computing

In the era of 5G mobile communication, there has been a significant surg...
research
02/05/2022

Reinforcement learning for multi-item retrieval in the puzzle-based storage system

Nowadays, fast delivery services have created the need for high-density ...
research
04/21/2021

Model-aided Deep Reinforcement Learning for Sample-efficient UAV Trajectory Design in IoT Networks

Deep Reinforcement Learning (DRL) is gaining attention as a potential ap...
research
08/06/2020

Deep Reinforcement Learning based Local Planner for UAV Obstacle Avoidance using Demonstration Data

In this paper, a deep reinforcement learning (DRL) method is proposed to...
research
09/06/2023

Learning to Recharge: UAV Coverage Path Planning through Deep Reinforcement Learning

Coverage path planning (CPP) is a critical problem in robotics, where th...
research
05/18/2023

Automatic Design Method of Building Pipeline Layout Based on Deep Reinforcement Learning

The layout design of pipelines is a critical task in the construction in...

Please sign up or login with your details

Forgot password? Click here to reset