Synthesizing Stable Reduced-Order Visuomotor Policies for Nonlinear Systems via Sums-of-Squares Optimization

04/24/2023
by   Glen Chou, et al.
0

We present a method for synthesizing dynamic, reduced-order output-feedback polynomial control policies for control-affine nonlinear systems which guarantees runtime stability to a goal state, when using visual observations and a learned perception module in the feedback control loop. We leverage Lyapunov analysis to formulate the problem of synthesizing such policies. This problem is nonconvex in the policy parameters and the Lyapunov function that is used to prove the stability of the policy. To solve this problem approximately, we propose two approaches: the first solves a sequence of sum-of-squares optimization problems to iteratively improve a policy which is provably-stable by construction, while the second directly performs gradient-based optimization on the parameters of the polynomial policy, and its closed-loop stability is verified a posteriori. We extend our approach to provide stability guarantees in the presence of observation noise, which realistically arises due to errors in the learned perception module. We evaluate our approach on several underactuated nonlinear systems, including pendula and quadrotors, showing that our guarantees translate to empirical stability when controlling these systems from images, while baseline approaches can fail to reliably stabilize the system.

READ FULL TEXT

page 1

page 5

page 6

page 7

research
03/02/2022

Learning Stochastic Parametric Differentiable Predictive Control Policies

The problem of synthesizing stochastic explicit model predictive control...
research
01/03/2022

Revisiting PGD Attacks for Stability Analysis of Large-Scale Nonlinear Systems and Perception-Based Control

Many existing region-of-attraction (ROA) analysis tools find difficulty ...
research
12/02/2021

Youla-REN: Learning Nonlinear Feedback Policies with Robust Stability Guarantees

This paper presents a parameterization of nonlinear controllers for unce...
research
03/27/2021

On the Stability of Nonlinear Receding Horizon Control: A Geometric Perspective

The widespread adoption of nonlinear Receding Horizon Control (RHC) stra...
research
02/18/2021

Closing the Closed-Loop Distribution Shift in Safe Imitation Learning

Commonly used optimization-based control strategies such as model-predic...
research
05/16/2023

The Power of Learned Locally Linear Models for Nonlinear Policy Optimization

A common pipeline in learning-based control is to iteratively estimate a...
research
03/22/2022

Neural System Level Synthesis: Learning over All Stabilizing Policies for Nonlinear Systems

We address the problem of designing stabilizing control policies for non...

Please sign up or login with your details

Forgot password? Click here to reset