The Power of Linear Controllers in LQR Control

02/07/2020
by   Gautam Goel, et al.
0

The Linear Quadratic Regulator (LQR) framework considers the problem of regulating a linear dynamical system perturbed by environmental noise. We compute the policy regret between three distinct control policies: i) the optimal online policy, whose linear structure is given by the Ricatti equations; ii) the optimal offline linear policy, which is the best linear state feedback policy given the noise sequence; and iii) the optimal offline policy, which selects the globally optimal control actions given the noise sequence. We fully characterize the optimal offline policy and show that it has a recursive form in terms of the optimal online policy and future disturbances. We also show that cost of the optimal offline linear policy converges to the cost of the optimal online policy as the time horizon grows large, and consequently the optimal offline linear policy incurs linear regret relative to the optimal offline policy, even in the optimistic setting where the noise is drawn i.i.d from a known distribution. Although we focus on the setting where the noise is stochastic, our results also imply new lower bounds on the policy regret achievable when the noise is chosen by an adaptive adversary.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/24/2020

Regret-optimal measurement-feedback control

We consider measurement-feedback control in linear dynamical systems fro...
research
11/17/2022

Introduction to Online Nonstochastic Control

This text presents an introduction to an emerging paradigm in control of...
research
10/10/2020

Online Optimal Control with Affine Constraints

This paper considers online optimal control with affine constraints on t...
research
01/31/2021

Fast Rates for the Regret of Offline Reinforcement Learning

We study the regret of reinforcement learning from offline data generate...
research
01/02/2020

Adversarial Policies in Learning Systems with Malicious Experts

We consider a learning system based on the conventional multiplicative w...
research
06/10/2020

Making Non-Stochastic Control (Almost) as Easy as Stochastic

Recent literature has made much progress in understanding online LQR: a ...
research
10/20/2017

Uniformly bounded regret in the multi-secretary problem

In the secretary problem of Cayley (1875) and Moser (1956), n non-negati...

Please sign up or login with your details

Forgot password? Click here to reset