Data-Efficient Reinforcement Learning with Probabilistic Model Predictive Control

06/20/2017
by   Sanket Kamthe, et al.
0

Trial-and-error based reinforcement learning (RL) has seen rapid advancements in recent times, especially with the advent of deep neural networks. However, the majority of autonomous RL algorithms either rely on engineered features or a large number of interactions with the environment. Such a large number of interactions may be impractical in many real-world applications. For example, robots are subject to wear and tear and, hence, millions of interactions may change or damage the system. Moreover, practical systems have limitations in the form of the maximum torque that can be safely applied. To reduce the number of system interactions while naturally handling constraints, we propose a model-based RL framework based on Model Predictive Control (MPC). In particular, we propose to learn a probabilistic transition model using Gaussian Processes (GPs) to incorporate model uncertainties into long-term predictions, thereby, reducing the impact of model errors. We then use MPC to find a control sequence that minimises the expected long-term cost. We provide theoretical guarantees for the first-order optimality in the GP-based transition models with deterministic approximate inference for long-term planning. The proposed framework demonstrates superior data efficiency and learning rates compared to the current state of the art.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/10/2015

Gaussian Processes for Data-Efficient Learning in Robotics and Control

Autonomous learning has been a promising direction in control and roboti...
research
06/01/2023

IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control

Model-based reinforcement learning (RL) has shown great promise due to i...
research
11/11/2019

Driving Reinforcement Learning with Models

Over the years, Reinforcement Learning (RL) established itself as a conv...
research
04/20/2021

Model-predictive control and reinforcement learning in multi-energy system case studies

Model-predictive-control (MPC) offers an optimal control technique to es...
research
06/16/2023

Actor-Critic Model Predictive Control

Despite its success, Model Predictive Control (MPC) often requires inten...
research
10/21/2022

Deep Reinforcement Learning for Stabilization of Large-scale Probabilistic Boolean Networks

The ability to direct a Probabilistic Boolean Network (PBN) to a desired...
research
02/05/2020

Deep Learning Tubes for Tube MPC

Learning-based control aims to construct models of a system to use for p...

Please sign up or login with your details

Forgot password? Click here to reset