Technical Report: Adaptive Control for Linearizable Systems Using On-Policy Reinforcement Learning

04/06/2020
by   Tyler Westenbroek, et al.
5

This paper proposes a framework for adaptively learning a feedback linearization-based tracking controller for an unknown system using discrete-time model-free policy-gradient parameter update rules. The primary advantage of the scheme over standard model-reference adaptive control techniques is that it does not require the learned inverse model to be invertible at all instances of time. This enables the use of general function approximators to approximate the linearizing controller for the system without having to worry about singularities. However, the discrete-time and stochastic nature of these algorithms precludes the direct application of standard machinery from the adaptive control literature to provide deterministic stability proofs for the system. Nevertheless, we leverage these techniques alongside tools from the stochastic approximation literature to demonstrate that with high probability the tracking and parameter errors concentrate near zero when a certain persistence of excitation condition is satisfied. A simulated example of a double pendulum demonstrates the utility of the proposed theory. 1

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/25/2023

Revisiting LQR Control from the Perspective of Receding-Horizon Policy Gradient

We revisit in this paper the discrete-time linear quadratic regulator (L...
research
04/02/2023

Stability Bounds for Learning-Based Adaptive Control of Discrete-Time Multi-Dimensional Stochastic Linear Systems with Input Constraints

We consider the problem of adaptive stabilization for discrete-time, mul...
research
01/31/2022

Steady-State Error Compensation in Reference Tracking and Disturbance Rejection Problems for Reinforcement Learning-Based Control

Reinforcement learning (RL) is a promising, upcoming topic in automatic ...
research
08/20/2020

Model-free optimal control of discrete-time systems with additive and multiplicative noises

This paper investigates the optimal control problem for a class of discr...
research
12/14/2021

Nonlinear Discrete-time Systems' Identification without Persistence of Excitation: A Finite-time Concurrent Learning

This paper deals with the problem of finite-time learning for unknown di...
research
06/15/2020

An online evolving framework for advancing reinforcement-learning based automated vehicle control

In this paper, an online evolving framework is proposed to detect and re...
research
03/16/2022

Input Influence Matrix Design for MIMO Discrete-Time Ultra-Local Model

Ultra-Local Models (ULM) have been applied to perform model-free control...

Please sign up or login with your details

Forgot password? Click here to reset