Multi-agent Reinforcement Learning in Sequential Social Dilemmas

02/10/2017
by   Joel Z. Leibo, et al.
0

Matrix games like Prisoner's Dilemma have guided research on social dilemmas for decades. However, they necessarily treat the choice to cooperate or defect as an atomic action. In real-world social dilemmas these choices are temporally extended. Cooperativeness is a property that applies to policies, not elementary actions. We introduce sequential social dilemmas that share the mixed incentive structure of matrix game social dilemmas but also require agents to learn policies that implement their strategic intentions. We analyze the dynamics of policies learned by multiple self-interested independent learning agents, each using its own deep Q-network, on two Markov games we introduce here: 1. a fruit Gathering game and 2. a Wolfpack hunting game. We characterize how learned behavior in each domain changes as a function of environmental factors including resource abundance. Our experiments show how conflict can emerge from competition over shared resources and shed light on how the sequential nature of real world social dilemmas affects cooperation.

READ FULL TEXT

page 3

page 4

page 6

page 7

research
03/01/2018

Towards Cooperation in Sequential Prisoner's Dilemmas: a Deep Multiagent Reinforcement Learning Approach

The Iterated Prisoner's Dilemma has guided research on social dilemmas f...
research
06/09/2020

Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior

Prisoner's Dilemma mainly treat the choice to cooperate or defect as an ...
research
03/23/2018

Inequity aversion resolves intertemporal social dilemmas

Groups of humans are often able to find ways to cooperate with one anoth...
research
03/23/2018

Inequity aversion improves cooperation in intertemporal social dilemmas

Groups of humans are often able to find ways to cooperate with one anoth...
research
03/19/2019

Learning Reciprocity in Complex Sequential Social Dilemmas

Reciprocity is an important feature of human social interaction and unde...
research
02/28/2023

IQ-Flow: Mechanism Design for Inducing Cooperative Behavior to Self-Interested Agents in Sequential Social Dilemmas

Achieving and maintaining cooperation between agents to accomplish a com...
research
07/15/2022

Stochastic Market Games

Some of the most relevant future applications of multi-agent systems lik...

Please sign up or login with your details

Forgot password? Click here to reset