Deterministic limit of temporal difference reinforcement learning for stochastic games

09/19/2018
by   Wolfram Barfuss, et al.
0

Reinforcement learning in multi-agent systems has been studied in the fields of economic game theory, artificial intelligence and statistical physics by developing an analytical understanding of the learning dynamics (often in relation to the replicator dynamics of evolutionary game theory). However, the majority of these analytical studies focuses on repeated normal form games, which only have a single environmental state. Environmental dynamics, i.e. changes in the state of an environment affecting the agents' payoffs has received less attention, lacking a universal method to obtain deterministic equations from established multi-state reinforcement learning algorithms. In this work we present a novel methodology to derive the deterministic limit resulting from an interaction-adaptation time scales separation of a general class of reinforcement learning algorithms, called temporal difference learning. This form of learning is equipped to function in more realistic multi-state environments by using the estimated value of future environmental states to adapt the agent's behavior. We demonstrate the potential of our method with the three well established learning algorithms Q learning, SARSA learning and Actor-Critic learning. Illustrations of their dynamics on two multi-agent, multi-state environments reveal a wide range of different dynamical regimes, such as convergence to fixed points, limit cycles and even deterministic chaos.

READ FULL TEXT
research
10/01/2017

Parameter Sharing Deep Deterministic Policy Gradient for Cooperative Multi-agent Reinforcement Learning

Deep reinforcement learning for multi-agent cooperation and competition ...
research
09/15/2021

Evolutionary Reinforcement Learning Dynamics with Irreducible Environmental Uncertainty

In this work we derive and present evolutionary reinforcement learning d...
research
11/23/2021

Independent Learning in Stochastic Games

Reinforcement learning (RL) has recently achieved tremendous successes i...
research
01/29/2021

Poincaré-Bendixson Limit Sets in Multi-Agent Learning

A key challenge of evolutionary game theory and multi-agent learning is ...
research
10/06/2020

Heterogeneous Multi-Agent Reinforcement Learning for Unknown Environment Mapping

Reinforcement learning in heterogeneous multi-agent scenarios is importa...
research
11/26/2019

The problem with DDPG: understanding failures in deterministic environments with sparse rewards

In environments with continuous state and action spaces, state-of-the-ar...
research
02/11/2021

Echo State Networks for Reinforcement Learning

Echo State Networks (ESNs) are a type of single-layer recurrent neural n...

Please sign up or login with your details

Forgot password? Click here to reset