Learning to Communicate to Solve Riddles with Deep Distributed Recurrent Q-Networks

02/08/2016
by   Jakob N. Foerster, et al.
0

We propose deep distributed recurrent Q-networks (DDRQN), which enable teams of agents to learn to solve communication-based coordination tasks. In these tasks, the agents are not given any pre-designed communication protocol. Therefore, in order to successfully communicate, they must first automatically develop and agree upon their own communication protocol. We present empirical results on two multi-agent learning problems based on well-known riddles, demonstrating that DDRQN can successfully solve such tasks and discover elegant communication protocols to do so. To our knowledge, this is the first time deep reinforcement learning has succeeded in learning communication protocols. In addition, we present ablation experiments that confirm that each of the main components of the DDRQN architecture are critical to its success.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/16/2019

Learning Efficient Multi-agent Communication: An Information Bottleneck Approach

Many real-world multi-agent reinforcement learning applications require ...
research
11/02/2022

Over-communicate no more: Situated RL agents learn concise communication protocols

While it is known that communication facilitates cooperation in multi-ag...
research
01/24/2019

Decentralization of Multiagent Policies by Learning What to Communicate

Effective communication is required for teams of robots to solve sophist...
research
10/21/2019

Learning to Communicate in a Noisy Environment

In this work we examine the problem of learning to cooperate in the cont...
research
08/24/2018

A Communication Protocol for Man-Machine Networks

One of the most challenging coordination problems in artificial intellig...
research
04/16/2012

Efficient Protocols for Distributed Classification and Optimization

In distributed learning, the goal is to perform a learning task over dat...
research
12/14/2021

Learning to Guide and to Be Guided in the Architect-Builder Problem

We are interested in interactive agents that learn to coordinate, namely...

Please sign up or login with your details

Forgot password? Click here to reset