Symmetric equilibrium of multi-agent reinforcement learning in repeated prisoner's dilemma

01/28/2021
by   Yuki Usui, et al.
0

We investigate the repeated prisoner's dilemma game where both players alternately use reinforcement learning to obtain their optimal memory-one strategies. We theoretically solve the simultaneous Bellman optimality equations of reinforcement learning. We find that the Win-stay Lose-shift strategy, the Grim strategy, and the strategy which always defects can form symmetric equilibrium of the mutual reinforcement learning process amongst all deterministic memory-one strategies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/05/2021

Memory-two strategies forming symmetric mutual reinforcement learning equilibrium in repeated prisoner's dilemma game

We investigate symmetric equilibria of mutual reinforcement learning whe...
research
12/06/2022

Learning with Opponent Modeling in Repeated Auctions

We design an algorithm to learn bidding strategies in repeated auctions....
research
12/01/2020

Obtain Employee Turnover Rate and Optimal Reduction Strategy Based On Neural Network and Reinforcement Learning

Nowadays, human resource is an important part of various resources of en...
research
07/19/2022

Deep equilibrium networks are sensitive to initialization statistics

Deep equilibrium networks (DEQs) are a promising way to construct models...
research
06/16/2021

Reinforcement learning for pursuit and evasion of microswimmers at low Reynolds number

Aquatic organisms can use hydrodynamic cues to navigate, find their prey...
research
06/27/2019

QFlip: An Adaptive Reinforcement Learning Strategy for the FlipIt Security Game

A rise in Advanced Persistent Threats (APTs) has introduced a need for r...
research
01/28/2022

A deep Q-learning method for optimizing visual search strategies in backgrounds of dynamic noise

Humans process visual information with varying resolution (foveated visu...

Please sign up or login with your details

Forgot password? Click here to reset