ED2: An Environment Dynamics Decomposition Framework for World Model Construction

12/06/2021
by   Cong Wang, et al.
0

Model-based reinforcement learning methods achieve significant sample efficiency in many tasks, but their performance is often limited by the existence of the model error. To reduce the model error, previous works use a single well-designed network to fit the entire environment dynamics, which treats the environment dynamics as a black box. However, these methods lack to consider the environmental decomposed property that the dynamics may contain multiple sub-dynamics, which can be modeled separately, allowing us to construct the world model more accurately. In this paper, we propose the Environment Dynamics Decomposition (ED2), a novel world model construction framework that models the environment in a decomposing manner. ED2 contains two key components: sub-dynamics discovery (SD2) and dynamics decomposition prediction (D2P). SD2 discovers the sub-dynamics in an environment and then D2P constructs the decomposed world model following the sub-dynamics. ED2 can be easily combined with existing MBRL algorithms and empirical results show that ED2 significantly reduces the model error and boosts the performance of the state-of-the-art MBRL algorithms on various tasks.

READ FULL TEXT

page 7

page 14

research
12/21/2019

Can Agents Learn by Analogy? An Inferable Model for PAC Reinforcement Learning

Model-based reinforcement learning algorithms make decisions by building...
research
06/09/2022

A Relational Intervention Approach for Unsupervised Dynamics Generalization in Model-Based Reinforcement Learning

The generalization of model-based reinforcement learning (MBRL) methods ...
research
06/11/2019

Learning Powerful Policies by Using Consistent Dynamics Model

Model-based Reinforcement Learning approaches have the promise of being ...
research
02/16/2021

Model-based Meta Reinforcement Learning using Graph Structured Surrogate Models

Reinforcement learning is a promising paradigm for solving sequential de...
research
03/04/2019

Sequential Relational Decomposition

The concept of decomposition in computer science and engineering is cons...
research
06/08/2021

LEADS: Learning Dynamical Systems that Generalize Across Environments

When modeling dynamical systems from real-world data samples, the distri...
research
07/07/2021

Harnessing Heterogeneity: Learning from Decomposed Feedback in Bayesian Modeling

There is significant interest in learning and optimizing a complex syste...

Please sign up or login with your details

Forgot password? Click here to reset