Learning multiagent coordination in the absence of communication channels

02/16/2018
by   Aaron Goodman, et al.
0

In this work, we develop a reinforcement learning protocol for a multiagent coordination task in a discrete state and action space: an iterated prisoner's dilemma game extended into a team based, winner-take all tournament, which forces the agents to collude in order to maximize their reward. By disallowing extra communication channels, the agents are forced to embed their coordination strategy into their actions in the prisoner's dilemma game. We develop a representation of the iterated prisoners dilemma that makes it amenable to Q-learning. We find that the reinforcement learning strategy is able to consistently train agents that can win the winner take all iterated prisoners dilemma tournament. By using a game with discrete state and action space, we are able to better analyze and understand both the dynamics and the communication protocols that are established between the agents. We find that the agents adapt a number of interesting behaviors, such as the formation of benevolent dictators, that minimize inequality of scores. We also find that the agents settle on a remarkably consistent symbology in their actions, such that agents from independent trials are able to collude with each other without further training.

READ FULL TEXT

page 6

page 7

page 8

research
12/15/2020

Robust Multi-Agent Reinforcement Learning with Social Empowerment for Coordination and Communication

We consider the problem of robust multi-agent reinforcement learning (MA...
research
09/13/2020

Pow-Wow: A Dataset and Study on Collaborative Communication in Pommerman

In multi-agent learning, agents must coordinate with each other in order...
research
12/29/2019

Loss aversion fosters coordination among independent reinforcement learners

We study what are the factors that can accelerate the emergence of colla...
research
03/15/2019

Policy Distillation and Value Matching in Multiagent Reinforcement Learning

Multiagent reinforcement learning algorithms (MARL) have been demonstrat...
research
04/18/2020

Remote Empirical Coordination

We apply the framework of imperfect empirical coordination to a two-node...
research
10/31/2020

FireCommander: An Interactive, Probabilistic Multi-agent Environment for Joint Perception-Action Tasks

The purpose of this tutorial is to help individuals use the FireCommande...
research
10/20/2020

Negotiating Team Formation Using Deep Reinforcement Learning

When autonomous agents interact in the same environment, they must often...

Please sign up or login with your details

Forgot password? Click here to reset