Online Constrained Model-based Reinforcement Learning

04/07/2020
by   Benjamin van Niekerk, et al.
6

Applying reinforcement learning to robotic systems poses a number of challenging problems. A key requirement is the ability to handle continuous state and action spaces while remaining within a limited time and resource budget. Additionally, for safe operation, the system must make robust decisions under hard constraints. To address these challenges, we propose a model based approach that combines Gaussian Process regression and Receding Horizon Control. Using sparse spectrum Gaussian Processes, we extend previous work by updating the dynamics model incrementally from a stream of sensory data. This results in an agent that can learn and plan in real-time under non-linear constraints. We test our approach on a cart pole swing-up environment and demonstrate the benefits of online learning on an autonomous racing task. The environment's dynamics are learned from limited training data and can be reused in new task instances without retraining.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/08/2021

Learning Unstable Dynamics with One Minute of Data: A Differentiation-based Gaussian Process Approach

We present a straightforward and efficient way to estimate dynamics mode...
research
09/05/2023

Distributionally Robust Model-based Reinforcement Learning with Large State Spaces

Three major challenges in reinforcement learning are the complex dynamic...
research
11/02/2020

Sample-efficient reinforcement learning using deep Gaussian processes

Reinforcement learning provides a framework for learning to control whic...
research
06/20/2022

Guided Safe Shooting: model based reinforcement learning with safety constraints

In the last decade, reinforcement learning successfully solved complex c...
research
03/27/2023

Model-Based Reinforcement Learning with Isolated Imaginations

World models learn the consequences of actions in vision-based interacti...
research
12/30/2021

Learning Agent State Online with Recurrent Generate-and-Test

Learning continually and online from a continuous stream of data is chal...
research
10/23/2020

TAMPC: A Controller for Escaping Traps in Novel Environments

We propose an approach to online model adaptation and control in the cha...

Please sign up or login with your details

Forgot password? Click here to reset