Evolutionary Reinforcement Learning Dynamics with Irreducible Environmental Uncertainty

09/15/2021
by   Wolfram Barfuss, et al.
19

In this work we derive and present evolutionary reinforcement learning dynamics in which the agents are irreducibly uncertain about the current state of the environment. We evaluate the dynamics across different classes of partially observable agent-environment systems and find that irreducible environmental uncertainty can lead to better learning outcomes faster, stabilize the learning process and overcome social dilemmas. However, as expected, we do also find that partial observability may cause worse learning outcomes, for example, in the form of a catastrophic limit cycle. Compared to fully observant agents, learning with irreducible environmental uncertainty often requires more exploration and less weight on future rewards to obtain the best learning outcomes. Furthermore, we find a range of dynamical effects induced by partial observability, e.g., a critical slowing down of the learning processes between reward regimes and the separation of the learning dynamics into fast and slow directions. The presented dynamics are a practical tool for researchers in biology, social science and machine learning to systematically investigate the evolutionary effects of environmental uncertainty.

READ FULL TEXT

page 6

page 7

page 8

page 10

research
09/19/2018

Deterministic limit of temporal difference reinforcement learning for stochastic games

Reinforcement learning in multi-agent systems has been studied in the fi...
research
07/22/2023

Emergence of Adaptive Circadian Rhythms in Deep Reinforcement Learning

Adapting to regularities of the environment is critical for biological o...
research
11/24/2020

Solving The Lunar Lander Problem under Uncertainty using Reinforcement Learning

Reinforcement Learning (RL) is an area of machine learning concerned wit...
research
09/01/2022

Intrinsic fluctuations of reinforcement learning promote cooperation

In this work, we ask for and answer what makes classical reinforcement l...
research
12/03/2019

SafeLife 1.0: Exploring Side Effects in Complex Environments

We present SafeLife, a publicly available reinforcement learning environ...
research
05/05/2023

Biophysical Cybernetics of Directed Evolution and Eco-evolutionary Dynamics

Many major questions in the theory of evolutionary dynamics can in a mea...
research
06/18/2021

Meta-control of social learning strategies

Social learning, copying other's behavior without actual experience, off...

Please sign up or login with your details

Forgot password? Click here to reset