Control with Distributed Deep Reinforcement Learning: Learn a Better Policy

11/26/2018
by   Qihao Liu, et al.
0

Distributed approach is a very effective method to improve training efficiency of reinforcement learning. In this paper, we propose a new heuristic distributed architecture for deep reinforcement learning (DRL) algorithm, in which a PSO based network update mechanism is adopted to speed up learning an optimal policy besides using multiple agents for parallel training. In this mechanism, the update of neural network of each agent is not only according to the training result of itself, but also affected by the optimal neural network of all agents. In order to verify the effectiveness of the proposed method, the proposed architecture is implemented on the Deep Q-Network algorithm (DQN) and the Deep Deterministic Policy Gradient algorithm (DDPG) to train several typical control problems. The training results show that the proposed method is effective.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/15/2015

Massively Parallel Methods for Deep Reinforcement Learning

We present the first massively distributed architecture for deep reinfor...
research
06/17/2021

A Deep Reinforcement Learning Approach towards Pendulum Swing-up Problem based on TF-Agents

Adapting the idea of training CartPole with Deep Q-learning agent, we ar...
research
11/10/2017

Towards the Use of Deep Reinforcement Learning with Global Policy For Query-based Extractive Summarisation

Supervised approaches for text summarisation suffer from the problem of ...
research
08/21/2020

Biomechanic Posture Stabilisation via Iterative Training of Multi-policy Deep Reinforcement Learning Agents

It is not until we become senior citizens do we recognise how much we to...
research
04/03/2019

Random Projection in Neural Episodic Control

End-to-end deep reinforcement learning has enabled agents to learn with ...
research
06/04/2020

A Novel Update Mechanism for Q-Networks Based On Extreme Learning Machines

Reinforcement learning is a popular machine learning paradigm which can ...

Please sign up or login with your details

Forgot password? Click here to reset