Incentivizing the Emergence of Grounded Discrete Communication Between General Agents

01/06/2020
by   Thomas A. Unger, et al.
7

We converted the recently developed BabyAI grid world platform to a sender/receiver setup in order to test the hypothesis that established deep reinforcement learning techniques are sufficient to incentivize the emergence of a grounded discrete communication protocol between general agents. This is in contrast to previous experiments that employed straight-through estimation or tailored inductive biases. Our results show that these can indeed be avoided, by instead providing proper environmental incentives. Moreover, they show that a longer interval between communications incentivized more abstract semantics. In some cases, the communicating agents adapted to new environments more quickly than monolithic agents, showcasing the potential of emergent discrete communication for transfer learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

page 7

page 9

research
04/11/2018

Emergent Communication through Negotiation

Multi-agent reinforcement learning offers a way to study how communicati...
research
06/06/2023

Inductive Bias for Emergent Communication in a Continuous Setting

We study emergent communication in a multi-agent reinforcement learning ...
research
12/11/2019

Biases for Emergent Communication in Multi-agent Reinforcement Learning

We study the problem of emergent communication, in which language arises...
research
05/10/2023

Context-dependent communication under environmental constraints

There is significant evidence that real-world communication cannot be re...
research
10/04/2019

Developmentally motivated emergence of compositional communication via template transfer

This paper explores a novel approach to achieving emergent compositional...
research
01/30/2023

Communication Drives the Emergence of Language Universals in Neural Agents: Evidence from the Word-order/Case-marking Trade-off

Artificial learners often behave differently from human learners in the ...
research
07/22/2023

Emergence of Adaptive Circadian Rhythms in Deep Reinforcement Learning

Adapting to regularities of the environment is critical for biological o...

Please sign up or login with your details

Forgot password? Click here to reset