Beyond No-Regret: Competitive Control via Online Optimization with Memory

02/13/2020
by   Guanya Shi, et al.
15

This paper studies online control with adversarial disturbances using tools from online optimization with memory. Most work that bridges learning and control theory focuses on designing policies that are no-regret with respect to the best static linear controller in hindsight. However, the optimal offline controller can have orders-of-magnitude lower cost than the best linear controller. We instead focus on achieving constant competitive ratio compared to the offline optimal controller, which need not be linear or static. We provide a novel reduction from online control of a class of controllable systems to online convex optimization with memory. We then design a new algorithm for online convex optimization with memory, Optimistic Regularized Online Balanced Descent, that has a constant, dimension-free competitive ratio. This result, in turn, leads to a new constant-competitive approach for online control.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/28/2021

Competitive Control

We consider control from the perspective of competitive analysis. Unlike...
research
11/24/2020

Regret-optimal measurement-feedback control

We consider measurement-feedback control in linear dynamical systems fro...
research
11/21/2022

Best of Both Worlds in Online Control: Competitive Ratio and Policy Regret

We consider the fundamental problem of online control of a linear dynami...
research
03/21/2020

A new regret analysis for Adam-type algorithms

In this paper, we focus on a theory-practice gap for Adam and its varian...
research
10/29/2021

Online Optimization with Feedback Delay and Nonlinear Switching Cost

We study a variant of online optimization in which the learner receives ...
research
12/12/2020

Generating Adversarial Disturbances for Controller Verification

We consider the problem of generating maximally adversarial disturbances...
research
04/15/2020

Online Multiserver Convex Chasing and Optimization

We introduce the problem of k-chasing of convex functions, a simultaneou...

Please sign up or login with your details

Forgot password? Click here to reset