Investigation of risk-aware MDP and POMDP contingency management autonomy for UAS

04/03/2023
by   Prashin Sharma, et al.
0

Unmanned aircraft systems (UAS) are being increasingly adopted for various applications. The risk UAS poses to people and property must be kept to acceptable levels. This paper proposes risk-aware contingency management autonomy to prevent an accident in the event of component malfunction, specifically propulsion unit failure and/or battery degradation. The proposed autonomy is modeled as a Markov Decision Process (MDP) whose solution is a contingency management policy that appropriately executes emergency landing, flight termination or continuation of planned flight actions. Motivated by the potential for errors in fault/failure indicators, partial observability of the MDP state space is investigated. The performance of optimal policies is analyzed over varying observability conditions in a high-fidelity simulator. Results indicate that both partially observable MDP (POMDP) and maximum a posteriori MDP policies performed similarly over different state observability criteria, given the nearly deterministic state transition model.

READ FULL TEXT

page 6

page 9

page 13

page 19

research
01/16/2013

PEGASUS: A Policy Search Method for Large MDPs and POMDPs

We propose a new approach to the problem of searching a space of policie...
research
09/20/2018

Logically-Constrained Neural Fitted Q-Iteration

This paper proposes a method for efficient training of the Q-function fo...
research
09/13/2019

Reinforcement Learning: a Comparison of UCB Versus Alternative Adaptive Policies

In this paper we consider the basic version of Reinforcement Learning (R...
research
04/04/2021

Reinforcement Learning with Temporal Logic Constraints for Partially-Observable Markov Decision Processes

This paper proposes a reinforcement learning method for controller synth...
research
03/28/2017

Fast Optimization of Wildfire Suppression Policies with SMAC

Managers of US National Forests must decide what policy to apply for dea...
research
10/17/2022

Risk-Sensitive Markov Decision Processes with Long-Run CVaR Criterion

CVaR (Conditional Value at Risk) is a risk metric widely used in finance...
research
03/02/2021

Prognostics-Informed Battery Reconfiguration in a Multi-Battery Small UAS Energy System

Batteries have been identified as one most likely small UAS (sUAS) compo...

Please sign up or login with your details

Forgot password? Click here to reset