Path Planning Using Wassertein Distributionally Robust Deep Q-learning

11/04/2022
by   Cem Alpturk, et al.
0

We investigate the problem of risk averse robot path planning using the deep reinforcement learning and distributionally robust optimization perspectives. Our problem formulation involves modelling the robot as a stochastic linear dynamical system, assuming that a collection of process noise samples is available. We cast the risk averse motion planning problem as a Markov decision process and propose a continuous reward function design that explicitly takes into account the risk of collision with obstacles while encouraging the robot's motion towards the goal. We learn the risk-averse robot control actions through Lipschitz approximated Wasserstein distributionally robust deep Q-learning to hedge against the noise uncertainty. The learned control actions result in a safe and risk averse trajectory from the source to the goal, avoiding all the obstacles. Various supporting numerical simulations are presented to demonstrate our proposed approach.

READ FULL TEXT
research
09/16/2023

Distributionally Robust CVaR-Based Safety Filtering for Motion Planning in Uncertain Environments

Safety is a core challenge of autonomous robot motion planning, especial...
research
10/07/2017

Modelling and Path Planning of Snake Robot in cluttered environment

Studying snake robot locomotion in a cluttered environment has been a co...
research
04/16/2019

Explicit Motion Risk Representation

This paper presents a formal definition and explicit representation of r...
research
11/11/2020

Decentralized Motion Planning for Multi-Robot Navigation using Deep Reinforcement Learning

This work presents a decentralized motion planning framework for address...
research
09/25/2018

Path Planning and Controlled Crash Landing of a Quadcopter in case of a Rotor Failure

This paper presents a framework for controlled emergency landing of a qu...
research
02/25/2020

Safe Optimal Control under Parametric Uncertainties

We address the issue of safe optimal path planning under parametric unce...
research
09/13/2019

Deep Learned Path Planning via Randomized Reward-Linked-Goals and Potential Space Applications

Space exploration missions have seen use of increasingly sophisticated r...

Please sign up or login with your details

Forgot password? Click here to reset