Learning Social Conventions in Markov Games

06/26/2018
by   Adam Lerer, et al.
0

Social conventions - arbitrary ways to organize group behavior - are an important part of social life. Any agent that wants to enter an existing society must be able to learn its conventions (e.g. which side of the road to drive on, which language to speak) from relatively few observations or risk being unable to coordinate with everyone else. We consider the game theoretic framework of David Lewis which views the selection of a social convention as the selection of an equilibrium in a coordination game. We ask how to construct reinforcement learning based agents that can solve the convention learning task in the self-play paradigm: at training time the agent has access to a good model of the environment and a small amount of observations about how individuals in society act. The agent then has to construct a policy that is compatible with the test-time social convention. We study three environments from the literature which have multiple conventions: traffic, communication, and risky coordination. In each of these we observe that adding a small amount of imitation learning during self-play training greatly increases the probability that the strategy found by self-play fits well with the social convention the agent will face at test time. We show that this works even in an environment where standard independent multi-agent RL very rarely finds the correct test-time equilibrium.

READ FULL TEXT
research
06/26/2018

Learning Existing Social Conventions in Markov Games

In order for artificial agents to coordinate effectively with people, th...
research
09/04/2019

No Press Diplomacy: Modeling Multi-Agent Gameplay

Diplomacy is a seven-player non-stochastic, non-cooperative game, where ...
research
05/31/2023

Adaptive Coordination in Social Embodied Rearrangement

We present the task of "Social Rearrangement", consisting of cooperative...
research
01/05/2022

Conditional Imitation Learning for Multi-Agent Games

While advances in multi-agent learning have enabled the training of incr...
research
07/10/2021

Multi-Agent Imitation Learning with Copulas

Multi-agent imitation learning aims to train multiple agents to perform ...
research
09/20/2021

Two Approaches to Building Collaborative, Task-Oriented Dialog Agents through Self-Play

Task-oriented dialog systems are often trained on human/human dialogs, s...
research
12/11/2014

Reinforcement Learning and Nonparametric Detection of Game-Theoretic Equilibrium Play in Social Networks

This paper studies two important signal processing aspects of equilibriu...

Please sign up or login with your details

Forgot password? Click here to reset