Finite Time Analysis of Optimal Adaptive Policies for Linear-Quadratic Systems

We consider the classical problem of control of linear systems with quadratic cost. When the true system dynamics are unknown, an adaptive policy is required for learning the model parameters and planning a control policy simultaneously. Addressing this trade-off between accurate estimation and good control represents the main challenge in the area of adaptive control. Another important issue is to prevent the system becoming destabilized due to lack of knowledge of its dynamics. Asymptotically optimal approaches have been extensively studied in the literature, but there are very few non-asymptotic results which also do not provide a comprehensive treatment of the problem. In this work, we establish finite time high probability regret bounds that are optimal up to logarithmic factors. We also provide high probability guarantees for a stabilization algorithm based on random linear feedbacks. The results are obtained under very mild assumptions, requiring: (i) stabilizability of the matrices encoding the system's dynamics, and (ii) degree of heaviness of the noise distribution. To derive our results, we also introduce a number of new concepts and technical tools.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/22/2018

Finite Time Adaptive Stabilization of LQ Systems

Stabilization of linear systems with unknown dynamics is a canonical pro...
research
06/28/2018

On Optimality of Adaptive Linear-Quadratic Regulators

Adaptive regulation of linear systems represents a canonical problem in ...
research
05/23/2018

Regret Bounds for Robust Adaptive Control of the Linear Quadratic Regulator

We consider adaptive control of the Linear Quadratic Regulator (LQR), wh...
research
11/02/2020

Exact Asymptotics for Linear Quadratic Adaptive Control

Recent progress in reinforcement learning has led to remarkable performa...
research
03/25/2020

A Poisson Kalman Filter to Control the Dynamics of Neonatal Sepsis and Postinfectious Hydrocephalus

Neonatal sepsis (NS) and resulting complications, such as postinfectious...
research
11/10/2018

Input Perturbations for Adaptive Regulation and Learning

Design of adaptive algorithms for simultaneous regulation and estimation...
research
05/26/2021

Certainty Equivalent Quadratic Control for Markov Jump Systems

Real-world control applications often involve complex dynamics subject t...

Please sign up or login with your details

Forgot password? Click here to reset