Solving reward-collecting problems with UAVs: a comparison of online optimization and Q-learning

11/30/2021
by   Yixuan Liu, et al.
0

Uncrewed autonomous vehicles (UAVs) have made significant contributions to reconnaissance and surveillance missions in past US military campaigns. As the prevalence of UAVs increases, there has also been improvements in counter-UAV technology that makes it difficult for them to successfully obtain valuable intelligence within an area of interest. Hence, it has become important that modern UAVs can accomplish their missions while maximizing their chances of survival. In this work, we specifically study the problem of identifying a short path from a designated start to a goal, while collecting all rewards and avoiding adversaries that move randomly on the grid. We also provide a possible application of the framework in a military setting, that of autonomous casualty evacuation. We present a comparison of three methods to solve this problem: namely we implement a Deep Q-Learning model, an ε-greedy tabular Q-Learning model, and an online optimization framework. Our computational experiments, designed using simple grid-world environments with random adversaries showcase how these approaches work and compare them in terms of performance, accuracy, and computational time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/06/2022

Watch from sky: machine-learning-based multi-UAV network for predictive police surveillance

This paper presents the watch-from-sky framework, where multiple unmanne...
research
01/15/2022

Cooperative Multi-Agent Deep Reinforcement Learning for Reliable Surveillance via Autonomous Multi-UAV Control

CCTV-based surveillance using unmanned aerial vehicles (UAVs) is conside...
research
05/14/2020

Autonomous Planning for Multiple Aerial Cinematographers

This paper proposes a planning algorithm for autonomous media production...
research
10/23/2017

Video Labeling for Automatic Video Surveillance in Security Domains

Beyond traditional security methods, unmanned aerial vehicles (UAVs) hav...
research
11/06/2021

Roofline Model for UAVs:A Bottleneck Analysis Tool for Designing Compute Systems for Autonomous Drones

We present a bottleneck analysis tool for designing compute systems for ...
research
01/18/2023

Workload-Aware Scheduling using Markov Decision Process for Infrastructure-Assisted Learning-Based Multi-UAV Surveillance Networks

In modern networking research, infrastructure-assisted unmanned autonomo...
research
11/15/2021

Spatio-Temporal Split Learning for Autonomous Aerial Surveillance using Urban Air Mobility (UAM) Networks

Autonomous surveillance unmanned aerial vehicles (UAVs) are deployed to ...

Please sign up or login with your details

Forgot password? Click here to reset