Adaptive Risk Tendency: Nano Drone Navigation in Cluttered Environments with Distributional Reinforcement Learning

03/28/2022
by   Cheng Liu, et al.
0

Enabling robots with the capability of assessing risk and making risk-aware decisions is widely considered a key step toward ensuring robustness for robots operating under uncertainty. In this paper, we consider the specific case of a nano drone robot learning to navigate an apriori unknown environment while avoiding obstacles under partial observability. We present a distributional reinforcement learning framework in order to learn adaptive risk tendency policies. Specifically, we propose to use tail conditional variance of the learnt action-value distribution as an uncertainty measurement, and use a exponentially weighted average forecasting algorithm to automatically adapt the risk-tendency at run-time based on the observed uncertainty in the environment. We show our algorithm can adjust its risk-sensitivity on the fly both in simulation and real-world experiments and achieving better performance than risk-neutral policy or risk-averse policies. Code and real-world experiment video can be found in this repository: <https://github.com/tudelft/risk-sensitive-rl.git>

READ FULL TEXT

page 1

page 6

research
05/01/2020

Improving Robustness via Risk Averse Distributional Reinforcement Learning

One major obstacle that precludes the success of reinforcement learning ...
research
04/07/2021

Risk-Conditioned Distributional Soft Actor-Critic for Risk-Sensitive Navigation

Modern navigation algorithms based on deep reinforcement learning (RL) s...
research
10/30/2017

How Should a Robot Assess Risk? Towards an Axiomatic Theory of Risk in Robotics

Endowing robots with the capability of assessing risk and making risk-aw...
research
09/16/2021

Enabling risk-aware Reinforcement Learning for medical interventions through uncertainty decomposition

Reinforcement Learning (RL) is emerging as tool for tackling complex con...
research
08/23/2023

SafeAR: Towards Safer Algorithmic Recourse by Risk-Aware Policies

With the growing use of machine learning (ML) models in critical domains...
research
09/02/2020

Adaptive CVaR Optimization for Dynamical Systems with Path Space Stochastic Search

We present a general framework for optimizing the Conditional Value-at-R...
research
08/18/2023

Robust Quadrupedal Locomotion via Risk-Averse Policy Learning

The robustness of legged locomotion is crucial for quadrupedal robots in...

Please sign up or login with your details

Forgot password? Click here to reset