Tackling Asymmetric and Circular Sequential Social Dilemmas with Reinforcement Learning and Graph-based Tit-for-Tat

06/26/2022
by   Tangui Le Gléau, et al.
0

In many societal and industrial interactions, participants generally prefer their pure self-interest at the expense of the global welfare. Known as social dilemmas, this category of non-cooperative games offers situations where multiple actors should all cooperate to achieve the best outcome but greed and fear lead to a worst self-interested issue. Recently, the emergence of Deep Reinforcement Learning (RL) has generated revived interest in social dilemmas with the introduction of Sequential Social Dilemma (SSD). Cooperative agents mixing RL policies and Tit-for-tat (TFT) strategies have successfully addressed some non-optimal Nash equilibrium issues. However, this kind of paradigm requires symmetrical and direct cooperation between actors, conditions that are not met when mutual cooperation become asymmetric and is possible only with at least a third actor in a circular way. To tackle this issue, this paper extends SSD with Circular Sequential Social Dilemma (CSSD), a new kind of Markov games that better generalizes the diversity of cooperation between agents. Secondly, to address such circular and asymmetric cooperation, we propose a candidate solution based on RL policies and a graph-based TFT. We conducted some experiments on a simple multi-player grid world which offers adaptable cooperation structures. Our work confirmed that our graph-based approach is beneficial to address circular situations by encouraging self-interested agents to reach mutual cooperation.

READ FULL TEXT

page 7

page 12

research
07/04/2017

Maintaining cooperation in complex social dilemmas using deep reinforcement learning

Social dilemmas are situations where individuals face a temptation to in...
research
03/01/2018

Towards Cooperation in Sequential Prisoner's Dilemmas: a Deep Multiagent Reinforcement Learning Approach

The Iterated Prisoner's Dilemma has guided research on social dilemmas f...
research
01/15/2020

Inducing Cooperation in Multi-Agent Games Through Status-Quo Loss

Social dilemma situations bring out the conflict between individual and ...
research
03/15/2023

Coordinating Fully-Cooperative Agents Using Hierarchical Learning Anticipation

Learning anticipation is a reasoning paradigm in multi-agent reinforceme...
research
10/19/2017

Consequentialist conditional cooperation in social dilemmas with imperfect information

Social dilemmas, where mutual cooperation can lead to high payoffs but p...
research
02/28/2023

IQ-Flow: Mechanism Design for Inducing Cooperative Behavior to Self-Interested Agents in Sequential Social Dilemmas

Achieving and maintaining cooperation between agents to accomplish a com...
research
10/25/2020

Emergence and Stability of Self-Evolved Cooperative Strategies using Stochastic Machines

To investigate the origin of cooperative behaviors, we developed an evol...

Please sign up or login with your details

Forgot password? Click here to reset