Actor-Attention-Critic for Multi-Agent Reinforcement Learning

10/05/2018
by   Shariq Iqbal, et al.
6

Reinforcement learning in multi-agent scenarios is important for real-world applications but presents challenges beyond those seen in single-agent settings. We present an actor-critic algorithm that trains decentralized policies in multi-agent settings, using centrally computed critics that share an attention mechanism which selects relevant information for each agent at every timestep. This attention mechanism enables more effective and scalable learning in complex multi-agent environments, when compared to recent approaches. Our approach is applicable not only to cooperative settings with shared rewards, but also individualized reward settings, including adversarial settings, and it makes no assumptions about the action spaces of the agents. As such, it is flexible enough to be applied to most multi-agent learning problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/19/2021

Decentralized Deterministic Multi-Agent Reinforcement Learning

[Zhang, ICML 2018] provided the first decentralized actor-critic algorit...
research
10/06/2020

Heterogeneous Multi-Agent Reinforcement Learning for Unknown Environment Mapping

Reinforcement learning in heterogeneous multi-agent scenarios is importa...
research
03/29/2020

Parallel Knowledge Transfer in Multi-Agent Reinforcement Learning

Multi-agent reinforcement learning is a standard framework for modeling ...
research
02/24/2020

Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic

Multi-agent adversarial inverse reinforcement learning (MA-AIRL) is a re...
research
10/02/2021

AB-Mapper: Attention and BicNet Based Multi-agent Path Finding for Dynamic Crowded Environment

Multi-agent path finding in dynamic crowded environments is of great aca...
research
11/03/2022

Scalable Multi-Agent Reinforcement Learning through Intelligent Information Aggregation

We consider the problem of multi-agent navigation and collision avoidanc...
research
11/13/2018

Modelling the Dynamic Joint Policy of Teammates with Attention Multi-agent DDPG

Modelling and exploiting teammates' policies in cooperative multi-agent ...

Please sign up or login with your details

Forgot password? Click here to reset