Causal Dynamics Learning for Task-Independent State Abstraction

06/27/2022
by   Zizhao Wang, et al.
14

Learning dynamics models accurately is an important goal for Model-Based Reinforcement Learning (MBRL), but most MBRL methods learn a dense dynamics model which is vulnerable to spurious correlations and therefore generalizes poorly to unseen states. In this paper, we introduce Causal Dynamics Learning for Task-Independent State Abstraction (CDL), which first learns a theoretically proved causal dynamics model that removes unnecessary dependencies between state variables and the action, thus generalizing well to unseen states. A state abstraction can then be derived from the learned dynamics, which not only improves sample efficiency but also applies to a wider range of tasks than existing state abstraction methods. Evaluated on two simulated environments and downstream tasks, both the dynamics model and policies learned by the proposed method generalize well to unseen states and the derived state abstraction improves sample efficiency compared to learning without it.

READ FULL TEXT

page 17

page 18

page 19

page 20

page 21

page 22

page 27

page 28

research
02/19/2021

Model-Invariant State Abstractions for Model-Based Reinforcement Learning

Accuracy and generalization of dynamics models is key to the success of ...
research
06/03/2022

Offline Reinforcement Learning with Causal Structured World Models

Model-based methods have recently shown promising for offline reinforcem...
research
10/18/2021

MDP Abstraction with Successor Features

Abstraction plays an important role for generalisation of knowledge and ...
research
04/16/2019

Object-Oriented Dynamics Learning through Multi-Level Abstraction

Object-based approaches for learning action-conditioned dynamics has dem...
research
10/20/2022

MoCoDA: Model-based Counterfactual Data Augmentation

The number of states in a dynamic process is exponential in the number o...
research
09/14/2022

A Simple Approach for State-Action Abstraction using a Learned MDP Homomorphism

Animals are able to rapidly infer from limited experience when sets of s...
research
05/07/2023

Quantifying Consistency and Information Loss for Causal Abstraction Learning

Structural causal models provide a formalism to express causal relations...

Please sign up or login with your details

Forgot password? Click here to reset