Learning Accurate Long-term Dynamics for Model-based Reinforcement Learning

12/16/2020
by   Nathan O. Lambert, et al.
0

Accurately predicting the dynamics of robotic systems is crucial for model-based control and reinforcement learning. The most common way to estimate dynamics is by fitting a one-step ahead prediction model and using it to recursively propagate the predicted state distribution over long horizons. Unfortunately, this approach is known to compound even small prediction errors, making long-term predictions inaccurate. In this paper, we propose a new parametrization to supervised learning on state-action data to stably predict at longer horizons – that we call a trajectory-based model. This trajectory-based model takes an initial state, a future time index, and control parameters as inputs, and predicts the state at the future time. Our results in simulated and experimental robotic tasks show that our trajectory-based models yield significantly more accurate long term predictions, improved sample efficiency, and ability to predict task reward.

READ FULL TEXT

page 5

page 6

page 7

research
03/17/2022

Investigating Compounding Prediction Errors in Learned Dynamics Models

Accurately predicting the consequences of agents' actions is a key prere...
research
09/11/2019

Probabilistic Model Learning and Long-term Prediction for Contact-rich Manipulation Tasks

Learning dynamics models is an essential component of model-based reinfo...
research
03/05/2019

Learning Dynamics Model in Reinforcement Learning by Incorporating the Long Term Future

In model-based reinforcement learning, the agent interleaves between mod...
research
04/07/2022

Optimizing the Long-Term Behaviour of Deep Reinforcement Learning for Pushing and Grasping

We investigate the "Visual Pushing for Grasping" (VPG) system by Zeng et...
research
10/08/2021

Temporal Convolutions for Multi-Step Quadrotor Motion Prediction

Model-based control methods for robotic systems such as quadrotors, auto...
research
10/26/2022

Comparison of neural closure models for discretised PDEs

Neural closure models have recently been proposed as a method for effici...
research
07/28/2020

Brain Emotional Learning-based Prediction Model For the Prediction of Geomagnetic Storms

This study suggests a new data-driven model for the prediction of geomag...

Please sign up or login with your details

Forgot password? Click here to reset