On the Emergence of Cooperation in the Repeated Prisoner's Dilemma

11/24/2022
by   Maximilian Schaefer, et al.
0

Using simulations between pairs of ϵ-greedy q-learners with one-period memory, this article demonstrates that the potential function of the stochastic replicator dynamics (Foster and Young, 1990) allows it to predict the emergence of error-proof cooperative strategies from the underlying parameters of the repeated prisoner's dilemma. The observed cooperation rates between q-learners are related to the ratio between the kinetic energy exerted by the polar attractors of the replicator dynamics under the grim trigger strategy. The frontier separating the parameter space conducive to cooperation from the parameter space dominated by defection can be found by setting the kinetic energy ratio equal to a critical value, which is a function of the discount factor, f(δ) = δ/(1-δ), multiplied by a correction term to account for the effect of the algorithms' exploration probability. The gradient at the frontier increases with the distance between the game parameters and the hyperplane that characterizes the incentive compatibility constraint for cooperation under grim trigger. Building on literature from the neurosciences, which suggests that reinforcement learning is useful to understanding human behavior in risky environments, the article further explores the extent to which the frontier derived for q-learners also explains the emergence of cooperation between humans. Using metadata from laboratory experiments that analyze human choices in the infinitely repeated prisoner's dilemma, the cooperation rates between humans are compared to those observed between q-learners under similar conditions. The correlation coefficients between the cooperation rates observed for humans and those observed for q-learners are consistently above 0.8. The frontier derived from the simulations between q-learners is also found to predict the emergence of cooperation between humans.

READ FULL TEXT

page 15

page 17

research
03/27/2018

Emergence of Cooperation in the thermodynamic limit

Predicting how cooperative behavior arises in the thermodynamic limit is...
research
09/01/2022

Intrinsic fluctuations of reinforcement learning promote cooperation

In this work, we ask for and answer what makes classical reinforcement l...
research
09/19/2016

The Optional Prisoner's Dilemma in a Spatial Environment: Coevolving Game Strategy and Link Weights

In this paper, the Optional Prisoner's Dilemma game in a spatial environ...
research
09/30/2013

Signed Networks, Triadic Interactions and the Evolution of Cooperation

We outline a model to study the evolution of cooperation in a population...
research
06/11/2018

Adaptive Mechanism Design: Learning to Promote Cooperation

In the future, artificial learning agents are likely to become increasin...
research
06/03/2023

The conflict between self-interaction and updating passivity in the evolution of cooperation

In social dilemmas under weak selection, the capacity for a player to ex...
research
09/04/2020

Imitation of Success Leads to Cost of Living Mediated Fairness in the Ultimatum Game

The mechanism behind the emergence of cooperation in both biological and...

Please sign up or login with your details

Forgot password? Click here to reset