Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin Representation

06/05/2023
by   Wanpeng Zhang, et al.
0

In real-world scenarios, the application of reinforcement learning is significantly challenged by complex non-stationarity. Most existing methods attempt to model the changes of the environment explicitly, often requiring impractical prior knowledge. In this paper, we propose a new perspective, positing that non-stationarity can propagate and accumulate through complex causal relationships during state transitions, thereby compounding its sophistication and affecting policy learning. We believe that this challenge can be more effectively addressed by tracing the causal origin of non-stationarity. To this end, we introduce the Causal-Origin REPresentation (COREP) algorithm. COREP primarily employs a guided updating mechanism to learn a stable graph representation for states termed as causal-origin representation. By leveraging this representation, the learned policy exhibits impressive resilience to non-stationarity. We supplement our approach with a theoretical analysis grounded in the causal interpretation for non-stationary reinforcement learning, advocating for the validity of the causal-origin representation. Experimental results further demonstrate the superior performance of COREP over existing methods in tackling non-stationarity.

READ FULL TEXT

page 4

page 16

research
08/20/2019

Reinforcement Learning is not a Causal problem

We use an analogy between non-isomorphic mathematical structures defined...
research
10/29/2020

Causal variables from reinforcement learning using generalized Bellman equations

Many open problems in machine learning are intrinsically related to caus...
research
03/30/2022

Factored Adaptation for Non-Stationary Reinforcement Learning

Dealing with non-stationarity in environments (i.e., transition dynamics...
research
05/24/2019

Continual Reinforcement Learning in 3D Non-stationary Environments

High-dimensional always-changing environments constitute a hard challeng...
research
10/28/2022

Using Contrastive Samples for Identifying and Leveraging Possible Causal Relationships in Reinforcement Learning

A significant challenge in reinforcement learning is quantifying the com...
research
02/13/2019

Stable multi-instance learning visa causal inference

Multi-instance learning (MIL) deals with tasks where each example is rep...
research
10/07/2020

Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning

Humans show an innate ability to learn the regularities of the world thr...

Please sign up or login with your details

Forgot password? Click here to reset