Optimisation of Structured Neural Controller Based on Continuous-Time Policy Gradient

01/17/2022
by   Namhoon Cho, et al.
0

This study presents a policy optimisation framework for structured nonlinear control of continuous-time (deterministic) dynamic systems. The proposed approach prescribes a structure for the controller based on relevant scientific knowledge (such as Lyapunov stability theory or domain experiences) while considering the tunable elements inside the given structure as the point of parametrisation with neural networks. To optimise a cost represented as a function of the neural network weights, the proposed approach utilises the continuous-time policy gradient method based on adjoint sensitivity analysis as a means for correct and performant computation of cost gradient. This enables combining the stability, robustness, and physical interpretability of an analytically-derived structure for the feedback controller with the representational flexibility and optimised resulting performance provided by machine learning techniques. Such a hybrid paradigm for fixed-structure control synthesis is particularly useful for optimising adaptive nonlinear controllers to achieve improved performance in online operation, an area where the existing theory prevails the design of structure while lacking clear analytical understandings about tuning of the gains and the uncertainty model basis functions that govern the performance characteristics. Numerical experiments on aerospace applications illustrate the utility of the structured nonlinear controller optimisation framework.

READ FULL TEXT

page 15

page 16

research
12/12/2020

Faster Policy Learning with Continuous-Time Gradients

We study the estimation of policy gradients for continuous-time systems ...
research
02/11/2023

A Policy Gradient Framework for Stochastic Optimal Control Problems with Global Convergence Guarantee

In this work, we consider the stochastic optimal control problem in cont...
research
05/04/2020

Stability Analysis for Nonlinear Weakly Hard Real-Time Control Systems

This paper considers the stability analysis for nonlinear sampled-data s...
research
09/08/2022

Incremental Correction in Dynamic Systems Modelled with Neural Networks for Constraint Satisfaction

This study presents incremental correction methods for refining neural n...
research
09/08/2021

Recurrent Neural Network Controllers Synthesis with Stability Guarantees for Partially Observed Systems

Neural network controllers have become popular in control tasks thanks t...
research
06/20/2023

A Passivity-Based Method for Accelerated Convex Optimisation

This study presents a constructive methodology for designing accelerated...
research
02/09/2021

Orbital Stabilization of Point-to-Point Maneuvers in Underactuated Mechanical Systems

The task of inducing, via continuous static state-feedback, an asymptoti...

Please sign up or login with your details

Forgot password? Click here to reset