Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic

02/24/2020
by   Wonseok Jeon, et al.
0

Multi-agent adversarial inverse reinforcement learning (MA-AIRL) is a recent approach that applies single-agent AIRL to multi-agent problems where we seek to recover both policies for our agents and reward functions that promote expert-like behavior. While MA-AIRL has promising results on cooperative and competitive tasks, it is sample-inefficient and has only been validated empirically for small numbers of agents – its ability to scale to many agents remains an open question. We propose a multi-agent inverse RL algorithm that is more sample-efficient and scalable than previous works. Specifically, we employ multi-agent actor-attention-critic (MAAC) – an off-policy multi-agent RL (MARL) method – for the RL inner loop of the inverse RL procedure. In doing so, we are able to increase sample efficiency compared to state-of-the-art baselines, across both small- and large-scale tasks. Moreover, the RL agents trained on the rewards recovered by our method better match the experts than those trained on the rewards derived from the baselines. Finally, our method requires far fewer agent-environment interactions, particularly as the number of agents increases.

READ FULL TEXT

page 5

page 6

research
10/05/2018

Actor-Attention-Critic for Multi-Agent Reinforcement Learning

Reinforcement learning in multi-agent scenarios is important for real-wo...
research
12/23/2021

Learning Cooperative Multi-Agent Policies with Partial Reward Decoupling

One of the preeminent obstacles to scaling multi-agent reinforcement lea...
research
10/22/2019

Distributed interference cancellation in multi-agent scenarios

This paper considers the problem of detecting impaired and noisy nodes o...
research
05/08/2021

Scalable, Decentralized Multi-Agent Reinforcement Learning Methods Inspired by Stigmergy and Ant Colonies

Bolstering multi-agent learning algorithms to tackle complex coordinatio...
research
03/04/2022

AutoDIME: Automatic Design of Interesting Multi-Agent Environments

Designing a distribution of environments in which RL agents can learn in...
research
12/15/2022

Emergent Behaviors in Multi-Agent Target Acquisition

Only limited studies and superficial evaluations are available on agents...
research
12/09/2019

Adversarial recovery of agent rewards from latent spaces of the limit order book

Inverse reinforcement learning has proved its ability to explain state-a...

Please sign up or login with your details

Forgot password? Click here to reset