PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning

10/31/2019
by   Iou-Jen Liu, et al.
0

Sample efficiency and scalability to a large number of agents are two important goals for multi-agent reinforcement learning systems. Recent works got us closer to those goals, addressing non-stationarity of the environment from a single agent's perspective by utilizing a deep net critic which depends on all observations and actions. The critic input concatenates agent observations and actions in a user-specified order. However, since deep nets aren't permutation invariant, a permuted input changes the critic output despite the environment remaining identical. To avoid this inefficiency, we propose a 'permutation invariant critic' (PIC), which yields identical output irrespective of the agent permutation. This consistent representation enables our model to scale to 30 times more agents and to achieve improvements of test episode reward between 15 environment (MPE).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/10/2022

Effects of Spectral Normalization in Multi-agent Reinforcement Learning

A reliable critic is central to on-policy actor-critic learning. But it ...
research
05/18/2021

Permutation Invariant Policy Optimization for Mean-Field Multi-Agent Reinforcement Learning: A Principled Approach

Multi-agent reinforcement learning (MARL) becomes more challenging in th...
research
09/07/2021

The Sensory Neuron as a Transformer: Permutation-Invariant Neural Networks for Reinforcement Learning

In complex systems, we often observe complex global behavior emerge from...
research
06/10/2017

ACCNet: Actor-Coordinator-Critic Net for "Learning-to-Communicate" with Deep Multi-agent Reinforcement Learning

Communication is a critical factor for the big multi-agent world to stay...
research
06/17/2021

Many Agent Reinforcement Learning Under Partial Observability

Recent renewed interest in multi-agent reinforcement learning (MARL) has...
research
03/17/2021

Set-to-Sequence Methods in Machine Learning: a Review

Machine learning on sets towards sequential output is an important and u...
research
02/25/2023

Hierarchical Needs-driven Agent Learning Systems: From Deep Reinforcement Learning To Diverse Strategies

The needs describe the necessities for a system to survive and evolve, w...

Please sign up or login with your details

Forgot password? Click here to reset