Balancing Cooperativeness and Adaptiveness in the (Noisy) Iterated Prisoner's Dilemma

03/06/2023
by   Adrian Hutter, et al.
0

Ever since Axelrod's seminal work, tournaments served as the main benchmark for evaluating strategies in the Iterated Prisoner's Dilemma (IPD). In this work, we first introduce a strategy for the IPD which outperforms previous tournament champions when evaluated against the 239 strategies in the Axelrod library, at noise levels in the IPD ranging from 0 behind our strategy is to start playing a version of tit-for-tat which forgives unprovoked defections if their rate is not significantly above the noise level, while building a (memory-1) model of the opponent; then switch to a strategy which is optimally adapted to the model of the opponent. We then argue that the above strategy (like other prominent strategies) lacks a couple of desirable properties which are not well tested for by tournaments, but which will be relevant in other contexts: we want our strategy to be self-cooperating, i.e., cooperate with a clone with high probability, even at high noise levels; and we want it to be cooperation-inducing, i.e., optimal play against it should entail cooperating with high probability. We show that we can guarantee these properties, at a modest cost in tournament performance, by reverting from the strategy adapted to the opponent to the forgiving tit-for-tat strategy under suitable conditions

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/17/2022

High Probability Bounds for Stochastic Subgradient Schemes with Heavy Tailed Noise

In this work we study high probability bounds for stochastic subgradient...
research
04/01/2019

Recognising and evaluating the effectiveness of extortion in the Iterated Prisoner's Dilemma

Since the introduction of zero-determinant strategies, extortionate stra...
research
12/18/2017

Invincible Strategies of Iterated Prisoner's Dilemma

Iterated Prisoner's Dilemma(IPD) is a well-known benchmark for studying ...
research
05/10/2018

On Randomised Strategies in the λ-Calculus (Long Version)

In this work we introduce randomised reduction strategies, a notion alre...
research
03/21/2021

Effects of Dynamic-Win-Stay-Lose-Learn model with voluntary participation in social dilemma

In recent years, Win-Stay-Lose-Learn rule has attracted wide attention a...
research
03/16/2019

On-Line Balancing of Random Inputs

We consider an online vector balancing game where vectors v_t, chosen un...
research
01/05/2023

Evaluating Evasion Strategies in Zebrafish Larvae

An effective evasion strategy allows prey to survive encounters with pre...

Please sign up or login with your details

Forgot password? Click here to reset