Logarithmic Regret for Adversarial Online Control

02/29/2020
by   Dylan J. Foster, et al.
0

We introduce a new algorithm for online linear-quadratic control in a known system subject to adversarial disturbances. Existing regret bounds for this setting scale as √(T) unless strong stochastic assumptions are imposed on the disturbance process. We give the first algorithm with logarithmic regret for arbitrary adversarial disturbance sequences, provided the state and control costs are given by known quadratic functions. Our algorithm and analysis use a characterization for the optimal offline control law to reduce the online control problem to (delayed) online learning with approximate advantage functions. Compared to previous techniques, our approach does not need to control movement costs for the iterates, leading to logarithmic regret.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/11/2019

Logarithmic Regret for Online Control

We study optimal regret bounds for control in linear dynamical systems u...
research
10/05/2018

Online Learning to Rank with Features

We introduce a new model for online ranking in which the click probabili...
research
06/10/2020

Making Non-Stochastic Control (Almost) as Easy as Stochastic

Recent literature has made much progress in understanding online LQR: a ...
research
09/28/2021

Near-Linear Time Algorithm with Near-Logarithmic Regret Per Switch for Mixable/Exp-Concave Losses

We investigate the problem of online learning, which has gained signific...
research
07/21/2019

Alice's Adventures in the Markovian World

This paper proposes an algorithm Alice having no access to the physics l...
research
02/23/2019

Online Control with Adversarial Disturbances

We study the control of a linear dynamical system with adversarial distu...
research
05/24/2023

Optimal Rates for Bandit Nonstochastic Control

Linear Quadratic Regulator (LQR) and Linear Quadratic Gaussian (LQG) con...

Please sign up or login with your details

Forgot password? Click here to reset