Learning Over All Contracting and Lipschitz Closed-Loops for Partially-Observed Nonlinear Systems

04/12/2023
by   Nicholas H. Barbara, et al.
0

This paper presents a policy parameterization for learning-based control on nonlinear, partially-observed dynamical systems. The parameterization is based on a nonlinear version of the Youla parameterization and the recently proposed Recurrent Equilibrium Network (REN) class of models. We prove that the resulting Youla-REN parameterization automatically satisfies stability (contraction) and user-tunable robustness (Lipschitz) conditions on the closed-loop system. This means it can be used for safe learning-based control with no additional constraints or projections required to enforce stability or robustness. We test the new policy class in simulation on two reinforcement learning tasks: 1) magnetic suspension, and 2) inverting a rotary-arm pendulum. We find that the Youla-REN performs similarly to existing learning-based and optimal control methods while also ensuring stability and exhibiting improved robustness to adversarial disturbances.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/08/2021

Learning over All Stabilizing Nonlinear Controllers for a Partially-Observed Linear System

We propose a parameterization of nonlinear output feedback controllers f...
research
02/08/2022

Bingham Policy Parameterization for 3D Rotations in Reinforcement Learning

We propose a new policy parameterization for representing 3D rotations d...
research
06/22/2023

RobustNeuralNetworks.jl: a Package for Machine Learning and Data-Driven Control with Certified Robustness

Neural networks are typically sensitive to small input perturbations, le...
research
12/02/2021

Youla-REN: Learning Nonlinear Feedback Policies with Robust Stability Guarantees

This paper presents a parameterization of nonlinear controllers for unce...
research
12/01/2022

Learning Robust State Observers using Neural ODEs (longer version)

Relying on recent research results on Neural ODEs, this paper presents a...
research
12/20/2021

Adversarially Robust Stability Certificates can be Sample-Efficient

Motivated by bridging the simulation to reality gap in the context of sa...
research
04/07/2020

Learning Control Barrier Functions from Expert Demonstrations

Inspired by the success of imitation and inverse reinforcement learning ...

Please sign up or login with your details

Forgot password? Click here to reset