Almost Surely √(T) Regret Bound for Adaptive LQR

01/13/2023
by   Yiwen Lu, et al.
0

The Linear-Quadratic Regulation (LQR) problem with unknown system parameters has been widely studied, but it has remained unclear whether 𝒪̃(√(T)) regret, which is the best known dependence on time, can be achieved almost surely. In this paper, we propose an adaptive LQR controller with almost surely 𝒪̃(√(T)) regret upper bound. The controller features a circuit-breaking mechanism, which circumvents potential safety breach and guarantees the convergence of the system parameter estimate, but is shown to be triggered only finitely often and hence has negligible effect on the asymptotic performance of the controller. The proposed controller is also validated via simulation on Tennessee Eastman Process (TEP), a commonly used industrial process example.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/31/2020

Regret Minimization in Partially Observable Linear Quadratic Control

We study the problem of regret minimization in partially observable line...
research
02/11/2022

Rate-matching the regret lower-bound in the linear quadratic regulator with unknown dynamics

The theory of reinforcement learning currently suffers from a mismatch b...
research
05/23/2018

Regret Bounds for Robust Adaptive Control of the Linear Quadratic Regulator

We consider adaptive control of the Linear Quadratic Regulator (LQR), wh...
research
07/23/2020

Explore More and Improve Regret in Linear Quadratic Regulators

Stabilizing the unknown dynamics of a control system and minimizing regr...
research
01/25/2022

Augmented RBMLE-UCB Approach for Adaptive Control of Linear Quadratic Systems

We consider the problem of controlling a stochastic linear system with q...
research
04/10/2017

Application of the Waveform Relaxation Technique to the Co-Simulation of Power Converter Controller and Electrical Circuit Models

In this paper we present the co-simulation of a PID class power converte...
research
03/21/2020

Learning in Networked Control Systems

We design adaptive controller (learning rule) for a networked control sy...

Please sign up or login with your details

Forgot password? Click here to reset