Hierarchical Critics Assignment for Multi-agent Reinforcement Learning

02/08/2019
by   Zehong Cao, et al.
0

In this paper, we investigate the use of global information to speed up the learning process and increase the cumulative rewards of multi-agent reinforcement learning (MARL) tasks. Within the actor-critic MARL, we introduce multiple cooperative critics from two levels of the hierarchy and propose a hierarchical critic-based multi-agent reinforcement learning algorithm. In our approach, the agent is allowed to receive information from local and global critics in a competition task. The agent not only receives low-level details but also consider coordination from high levels that receiving global information to increase operation skills. Here, we define multiple cooperative critics in the top-bottom hierarchy, called the Hierarchical Critics Assignment (HCA) framework. Our experiment, a two-player tennis competition task in the Unity environment, tested HCA multi-agent framework based on Asynchronous Advantage Actor-Critic (A3C) with Proximal Policy Optimization (PPO) algorithm. The results showed that the HCA- framework outperforms the non-hierarchical critics baseline method for MARL tasks.

READ FULL TEXT
research
10/01/2017

Parameter Sharing Deep Deterministic Policy Gradient for Cooperative Multi-agent Reinforcement Learning

Deep reinforcement learning for multi-agent cooperation and competition ...
research
02/15/2021

Intelligent Electric Vehicle Charging Recommendation Based on Multi-Agent Reinforcement Learning

Electric Vehicle (EV) has become a preferable choice in the modern trans...
research
05/11/2022

Efficient Distributed Framework for Collaborative Multi-Agent Reinforcement Learning

Multi-agent reinforcement learning for incomplete information environmen...
research
02/25/2023

Hierarchical Needs-driven Agent Learning Systems: From Deep Reinforcement Learning To Diverse Strategies

The needs describe the necessities for a system to survive and evolve, w...
research
12/09/2019

Intelligent Coordination among Multiple Traffic Intersections Using Multi-Agent Reinforcement Learning

We use Asynchronous Advantage Actor Critic (A3C) for implementing an AI ...
research
09/27/2019

Multi-Agent Actor-Critic with Hierarchical Graph Attention Network

Most previous studies on multi-agent reinforcement learning focus on der...
research
06/08/2022

Stabilizing Voltage in Power Distribution Networks via Multi-Agent Reinforcement Learning with Transformer

The increased integration of renewable energy poses a slew of technical ...

Please sign up or login with your details

Forgot password? Click here to reset