Transfer learning with causal counterfactual reasoning in Decision Transformers

10/27/2021
by   Ayman Boustati, et al.
0

The ability to adapt to changes in environmental contingencies is an important challenge in reinforcement learning. Indeed, transferring previously acquired knowledge to environments with unseen structural properties can greatly enhance the flexibility and efficiency by which novel optimal policies may be constructed. In this work, we study the problem of transfer learning under changes in the environment dynamics. In this study, we apply causal reasoning in the offline reinforcement learning setting to transfer a learned policy to new environments. Specifically, we use the Decision Transformer (DT) architecture to distill a new policy on the new environment. The DT is trained on data collected by performing policy rollouts on factual and counterfactual simulations from the source environment. We show that this mechanism can bootstrap a successful policy on the target environment while retaining most of the reward.

READ FULL TEXT

page 3

page 5

research
07/18/2020

Structure Mapping for Transferability of Causal Models

Human beings learn causal models and constantly use them to transfer kno...
research
09/28/2019

MULTIPOLAR: Multi-Source Policy Aggregation for Transfer Reinforcement Learning between Diverse Environmental Dynamics

Transfer reinforcement learning (RL) aims at improving learning efficien...
research
06/20/2020

Counterfactually Guided Policy Transfer in Clinical Settings

Reliably transferring treatment policies learned in one clinical environ...
research
07/06/2021

AdaRL: What, Where, and How to Adapt in Transfer Reinforcement Learning

Most approaches in reinforcement learning (RL) are data-hungry and speci...
research
04/18/2023

Think Before You Act: Unified Policy for Interleaving Language Reasoning with Actions

The success of transformer models trained with a language modeling objec...
research
11/26/2022

Transfer RL via the Undo Maps Formalism

Transferring knowledge across domains is one of the most fundamental pro...
research
09/19/2021

Dual Behavior Regularized Reinforcement Learning

Reinforcement learning has been shown to perform a range of complex task...

Please sign up or login with your details

Forgot password? Click here to reset