Model-Based Reinforcement Learning for Stochastic Hybrid Systems

11/11/2021
by   Hany Abdulsamad, et al.
11

Optimal control of general nonlinear systems is a central challenge in automation. Data-driven approaches to control, enabled by powerful function approximators, have recently had great success in tackling challenging robotic applications. However, such methods often obscure the structure of dynamics and control behind black-box over-parameterized representations, thus limiting our ability to understand the closed-loop behavior. This paper adopts a hybrid-system view of nonlinear modeling and control that lends an explicit hierarchical structure to the problem and breaks down complex dynamics into simpler localized units. Therefore, we consider a sequence modeling paradigm that captures the temporal structure of the data and derive an expecation-maximization (EM) algorithm that automatically decomposes nonlinear dynamics into stochastic piecewise affine dynamical systems with nonlinear boundaries. Furthermore, we show that these time-series models naturally admit a closed-loop extension that we use to extract locally linear or polynomial feedback controllers from nonlinear experts via imitation learning. Finally, we introduce a novel hybrid realtive entropy policy search (Hb-REPS) technique that incorporates the hierarchical nature of hybrid systems and optimizes a set of time-invariant local feedback controllers derived from a locally polynomial approximation of a global value function.

READ FULL TEXT

page 1

page 10

research
05/04/2020

Hierarchical Decomposition of Nonlinear Dynamics and Control for System Identification and Policy Distillation

The control of nonlinear dynamical systems remains a major challenge for...
research
04/17/2019

Decoupled Data Based Approach for Learning to Control Nonlinear Dynamical Systems

This paper addresses the problem of learning the optimal control policy ...
research
03/07/2019

RLOC: Neurobiologically Inspired Hierarchical Reinforcement Learning Algorithm for Continuous Control of Nonlinear Dynamical Systems

Nonlinear optimal control problems are often solved with numerical metho...
research
07/11/2018

A Hierarchical Bayesian Linear Regression Model with Local Features for Stochastic Dynamics Approximation

One of the challenges in model-based control of stochastic dynamical sys...
research
04/25/2023

Suboptimal Controller Synthesis for Cart-Poles and Quadrotors via Sums-of-Squares

Sums-of-squares (SOS) optimization is a promising tool to synthesize cer...
research
04/06/2021

Adaptive Variants of Optimal Feedback Policies

We combine adaptive control directly with optimal or near-optimal value ...
research
09/23/2022

Reactive Anticipatory Robot Skills with Memory

Optimal control in robotics has been increasingly popular in recent year...

Please sign up or login with your details

Forgot password? Click here to reset