Particle-Based Adaptive Discretization for Continuous Control using Deep Reinforcement Learning

03/16/2020
by   Pei Xu, et al.
0

Learning controls in high-dimensional continuous action spaces, such as controlling the movements of highly articulated agents and robots, has long been a standing challenge to model-free deep reinforcement learning (DRL). In this paper we propose a general, yet simple, framework for improving the action exploration of policy gradient DRL algorithms. Our approach adapts ideas from the particle filtering literature to dynamically discretize the continuous action space and track policies represented as a mixture of Gaussians. We demonstrate the applicability of our approach on state-of-the-art DRL baselines in challenging high-dimensional motor tasks involving articulated agents. We show that our adaptive particle-based discretization leads to improved final performance and speed of convergence as compared to uniform discretization schemes and to corresponding implementations in continuous action spaces, highlighting the importance of exploration. In addition, the resulting policies are more stable, exhibiting less variance across different training trials.

READ FULL TEXT

page 8

page 16

page 21

research
01/31/2017

Deep Reinforcement Learning for Robotic Manipulation-The state of the art

The focus of this work is to enumerate the various approaches and algori...
research
10/17/2019

Adaptive Discretization for Episodic Reinforcement Learning in Metric Spaces

We present an efficient algorithm for model-free episodic reinforcement ...
research
11/29/2022

Continuous Neural Algorithmic Planners

Neural algorithmic reasoning studies the problem of learning algorithms ...
research
09/29/2022

Hierarchical Training of Deep Ensemble Policies for Reinforcement Learning in Continuous Spaces

Many actor-critic deep reinforcement learning (DRL) algorithms have achi...
research
03/14/2019

Deep Reinforcement Learning with Feedback-based Exploration

Deep Reinforcement Learning has enabled the control of increasingly comp...
research
01/06/2023

Centralized Cooperative Exploration Policy for Continuous Control Tasks

The deep reinforcement learning (DRL) algorithm works brilliantly on sol...

Please sign up or login with your details

Forgot password? Click here to reset