FORESEE: Model-based Reinforcement Learning using Unscented Transform with application to Tuning of Control Barrier Functions

09/26/2022
by   Hardik Parwana, et al.
0

In this paper, we introduce a novel online model-based reinforcement learning algorithm that uses Unscented Transform to propagate uncertainty for the prediction of the future reward. Previous approaches either approximate the state distribution at each step of the prediction horizon with a Gaussian, or perform Monte Carlo simulations to estimate the rewards. Our method, depending on the number of sigma points employed, can propagate either mean and covariance with minimal points, or higher-order moments with more points similarly to Monte Carlo. The whole framework is implemented as a computational graph for online training. Furthermore, in order to prevent explosion in the number of sigma points when propagating through a generic state-dependent uncertainty model, we add sigma-point expansion and contraction layers to our graph, which are designed using the principle of moment matching. Finally, we propose gradient descent inspired by Sequential Quadratic Programming to update policy parameters in the presence of state constraints. We demonstrate the proposed method with two applications in simulation. The first one designs a stabilizing controller for the cart-pole problem when the dynamics is known with state-dependent uncertainty. The second example, following up on our previous work, tunes the parameters of a control barrier function-based Quadratic Programming controller for a leader-follower problem in the presence of input constraints.

READ FULL TEXT
research
01/21/2021

Model-based Policy Search for Partially Measurable Systems

In this paper, we propose a Model-Based Reinforcement Learning (MBRL) al...
research
04/16/2021

Safe Exploration in Model-based Reinforcement Learning using Control Barrier Functions

This paper studies the problem of developing an approximate dynamic prog...
research
03/23/2023

Rate-Tunable Control Barrier Functions: Methods and Algorithms for Online Adaptation

Control Barrier Functions offer safety certificates by dictating control...
research
01/28/2021

Model-Based Policy Search Using Monte Carlo Gradient Estimation with Real Systems Application

In this paper, we present a Model-Based Reinforcement Learning algorithm...
research
06/16/2022

Barrier Certified Safety Learning Control: When Sum-of-Square Programming Meets Reinforcement Learning

Safety guarantee is essential in many engineering implementations. Reinf...
research
04/15/2020

Bootstrapped model learning and error correction for planning with uncertainty in model-based RL

Having access to a forward model enables the use of planning algorithms ...
research
05/04/2021

Data-Efficient Reinforcement Learning for Malaria Control

Sequential decision-making under cost-sensitive tasks is prohibitively d...

Please sign up or login with your details

Forgot password? Click here to reset