Improper Learning for Non-Stochastic Control

01/25/2020
by   Max Simchowitz, et al.
0

We consider the problem of controlling a possibly unknown linear dynamical system with adversarial perturbations, adversarially chosen convex loss functions, and partially observed states, known as non-stochastic control. We introduce a controller parametrization based on the denoised observations, and prove that applying online gradient descent to this parametrization yields a new controller which attains sublinear regret vs. a large class of closed-loop policies. In the fully-adversarial setting, our controller attains an optimal regret bound of √(T)-when the system is known, and, when combined with an initial stage of least-squares estimation, T^2/3 when the system is unknown; both yield the first sublinear regret for the partially observed setting. Our bounds are the first in the non-stochastic control setting that compete with all stabilizing linear dynamical controllers, not just state feedback. Moreover, in the presence of semi-adversarial noise containing both stochastic and adversarial components, our controller attains the optimal regret bounds of poly(log T) when the system is known, and √(T) when unknown. To our knowledge, this gives the first end-to-end √(T) regret for online Linear Quadratic Gaussian controller, and applies in a more general setting with adversarial losses and semi-adversarial noise.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/12/2020

Non-Stochastic Control with Bandit Feedback

We study the problem of controlling a linear dynamical system with adver...
research
06/10/2020

Making Non-Stochastic Control (Almost) as Easy as Stochastic

Recent literature has made much progress in understanding online LQR: a ...
research
11/21/2022

Best of Both Worlds in Online Control: Competitive Ratio and Policy Regret

We consider the fundamental problem of online control of a linear dynami...
research
07/16/2023

Byzantine-Robust Distributed Online Learning: Taming Adversarial Participants in An Adversarial Environment

This paper studies distributed online learning under Byzantine attacks. ...
research
10/23/2022

LQGNet: Hybrid Model-Based and Data-Driven Linear Quadratic Stochastic Control

Stochastic control deals with finding an optimal control signal for a dy...
research
12/12/2020

Generating Adversarial Disturbances for Controller Verification

We consider the problem of generating maximally adversarial disturbances...
research
02/06/2020

No-Regret Prediction in Marginally Stable Systems

We consider the problem of online prediction in a marginally stable line...

Please sign up or login with your details

Forgot password? Click here to reset