Quasi-Equivalence Discovery for Zero-Shot Emergent Communication

03/14/2021
by   Kalesha Bullard, et al.
9

Effective communication is an important skill for enabling information exchange in multi-agent settings and emergent communication is now a vibrant field of research, with common settings involving discrete cheap-talk channels. Since, by definition, these settings involve arbitrary encoding of information, typically they do not allow for the learned protocols to generalize beyond training partners. In contrast, in this work, we present a novel problem setting and the Quasi-Equivalence Discovery (QED) algorithm that allows for zero-shot coordination (ZSC), i.e., discovering protocols that can generalize to independently trained agents. Real world problem settings often contain costly communication channels, e.g., robots have to physically move their limbs, and a non-uniform distribution over intents. We show that these two factors lead to unique optimal ZSC policies in referential games, where agents use the energy cost of the messages to communicate intent. Other-Play was recently introduced for learning optimal ZSC policies, but requires prior access to the symmetries of the problem. Instead, QED can iteratively discovers the symmetries in this setting and converges to the optimal ZSC policy.

READ FULL TEXT
POST COMMENT

Comments

There are no comments yet.

Authors

page 2

page 7

page 8

page 12

page 13

page 14

10/29/2020

Exploring Zero-Shot Emergent Communication in Embodied Multi-Agent Populations

Effective communication is an important skill for enabling information e...
03/06/2021

Off-Belief Learning

The standard problem setting in Dec-POMDPs is self-play, where the goal ...
06/11/2021

A New Formalism, Method and Open Issues for Zero-Shot Coordination

In many coordination problems, independently reasoning humans are able t...
01/28/2022

Any-Play: An Intrinsic Augmentation for Zero-Shot Coordination

Cooperative artificial intelligence with human or superhuman proficiency...
05/11/2021

Zero-Shot Generalization using Intrinsically Motivated Compositional Emergent Protocols

Human language has been described as a system that makes use of finite m...
02/07/2014

Frequency-Based Patrolling with Heterogeneous Agents and Limited Communication

This paper investigates multi-agent frequencybased patrolling of interse...
07/17/2021

Implicit Communication as Minimum Entropy Coupling

In many common-payoff games, achieving good performance requires players...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.