Sample Efficient Training in Multi-Agent Adversarial Games with Limited Teammate Communication

11/01/2020
by   Hardik Meisheri, et al.
0

We describe our solution approach for Pommerman TeamRadio, a competition environment associated with NeurIPS 2019. The defining feature of our algorithm is achieving sample efficiency within a restrictive computational budget while beating the previous years learning agents. The proposed algorithm (i) uses imitation learning to seed the policy, (ii) explicitly defines the communication protocol between the two teammates, (iii) shapes the reward to provide a richer feedback signal to each agent during training and (iv) uses masking for catastrophic bad actions. We describe extensive tests against baselines, including those from the 2019 competition leaderboard, and also a specific investigation of the learned policy and the effect of each modification on performance. We show that the proposed approach is able to achieve competitive performance within half a million games of training, significantly faster than other studies in the literature.

READ FULL TEXT

page 2

page 8

page 10

research
11/12/2019

Accelerating Training in Pommerman with Imitation and Reinforcement Learning

The Pommerman simulation was recently developed to mimic the classic Jap...
research
08/20/2023

Mimicking To Dominate: Imitation Learning Strategies for Success in Multiagent Competitive Games

Training agents in multi-agent competitive games presents significant ch...
research
10/30/2022

Imitating Opponent to Win: Adversarial Policy Imitation Learning in Two-player Competitive Games

Recent research on vulnerabilities of deep reinforcement learning (RL) h...
research
08/21/2021

MimicBot: Combining Imitation and Reinforcement Learning to win in Bot Bowl

This paper describe an hybrid agent trained to play in Fantasy Football ...
research
01/05/2022

Conditional Imitation Learning for Multi-Agent Games

While advances in multi-agent learning have enabled the training of incr...
research
11/27/2020

TStarBot-X: An Open-Sourced and Comprehensive Study for Efficient League Training in StarCraft II Full Game

StarCraft, one of the most difficult esport games with long-standing his...
research
09/08/2016

Ms. Pac-Man Versus Ghost Team CIG 2016 Competition

This paper introduces the revival of the popular Ms. Pac-Man Versus Ghos...

Please sign up or login with your details

Forgot password? Click here to reset