Dynamic Interval Restrictions on Action Spaces in Deep Reinforcement Learning for Obstacle Avoidance

06/13/2023
by   Tim Grams, et al.
0

Deep reinforcement learning algorithms typically act on the same set of actions. However, this is not sufficient for a wide range of real-world applications where different subsets are available at each step. In this thesis, we consider the problem of interval restrictions as they occur in pathfinding with dynamic obstacles. When actions that lead to collisions are avoided, the continuous action space is split into variable parts. Recent research learns with strong assumptions on the number of intervals, is limited to convex subsets, and the available actions are learned from the observations. Therefore, we propose two approaches that are independent of the state of the environment by extending parameterized reinforcement learning and ConstraintNet to handle an arbitrary number of intervals. We demonstrate their performance in an obstacle avoidance task and compare the methods to penalties, projection, replacement, as well as discrete and continuous masking from the literature. The results suggest that discrete masking of action-values is the only effective method when constraints did not emerge during training. When restrictions are learned, the decision between projection, masking, and our ConstraintNet modification seems to depend on the task at hand. We compare the results with varying complexity and give directions for future work.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/13/2015

Deep Reinforcement Learning in Parameterized Action Space

Recent work has shown that deep neural networks are capable of approxima...
research
03/08/2021

Vision-Based Mobile Robotics Obstacle Avoidance With Deep Reinforcement Learning

Obstacle avoidance is a fundamental and challenging problem for autonomo...
research
05/07/2021

Utilizing Skipped Frames in Action Repeats via Pseudo-Actions

In many deep reinforcement learning settings, when an agent takes an act...
research
12/24/2015

Deep Reinforcement Learning in Large Discrete Action Spaces

Being able to reason in an environment with a large number of discrete a...
research
12/23/2021

Missing Velocity in Dynamic Obstacle Avoidance based on Deep Reinforcement Learning

We introduce a novel approach to dynamic obstacle avoidance based on Dee...
research
03/08/2023

Using Memory-Based Learning to Solve Tasks with State-Action Constraints

Tasks where the set of possible actions depend discontinuously on the st...

Please sign up or login with your details

Forgot password? Click here to reset