Actor-Director-Critic: A Novel Deep Reinforcement Learning Framework

01/10/2023
by   Zongwei Liu, et al.
0

In this paper, we propose actor-director-critic, a new framework for deep reinforcement learning. Compared with the actor-critic framework, the director role is added, and action classification and action evaluation are applied simultaneously to improve the decision-making performance of the agent. Firstly, the actions of the agent are divided into high quality actions and low quality actions according to the rewards returned from the environment. Then, the director network is trained to have the ability to discriminate high and low quality actions and guide the actor network to reduce the repetitive exploration of low quality actions in the early stage of training. In addition, we propose an improved double estimator method to better solve the problem of overestimation in the field of reinforcement learning. For the two critic networks used, we design two target critic networks for each critic network instead of one. In this way, the target value of each critic network can be calculated by taking the average of the outputs of the two target critic networks, which is more stable and accurate than using only one target critic network to obtain the target value. In order to verify the performance of the actor-director-critic framework and the improved double estimator method, we applied them to the TD3 algorithm to improve the TD3 algorithm. Then, we carried out experiments in multiple environments in MuJoCo and compared the experimental data before and after the algorithm improvement. The final experimental results show that the improved algorithm can achieve faster convergence speed and higher total return.

READ FULL TEXT
research
10/24/2022

AACHER: Assorted Actor-Critic Deep Reinforcement Learning with Hindsight Experience Replay

Actor learning and critic learning are two components of the outstanding...
research
11/22/2022

Decision-making with Imaginary Opponent Models

Opponent modeling has benefited a controlled agent's decision-making by ...
research
03/04/2023

Double A3C: Deep Reinforcement Learning on OpenAI Gym Games

Reinforcement Learning (RL) is an area of machine learning figuring out ...
research
12/03/1998

Training Reinforcement Neurocontrollers Using the Polytope Algorithm

A new training algorithm is presented for delayed reinforcement learning...
research
08/23/2022

An intelligent algorithmic trading based on a risk-return reinforcement learning algorithm

This scientific paper propose a novel portfolio optimization model using...
research
06/04/2020

Refined Continuous Control of DDPG Actors via Parametrised Activation

In this paper, we propose enhancing actor-critic reinforcement learning ...
research
10/29/2021

Brick-by-Brick: Combinatorial Construction with Deep Reinforcement Learning

Discovering a solution in a combinatorial space is prevalent in many rea...

Please sign up or login with your details

Forgot password? Click here to reset