Learning to Control under Time-Varying Environment

06/06/2022
by   Yuzhen Han, et al.
0

This paper investigates the problem of regret minimization in linear time-varying (LTV) dynamical systems. Due to the simultaneous presence of uncertainty and non-stationarity, designing online control algorithms for unknown LTV systems remains a challenging task. At a cost of NP-hard offline planning, prior works have introduced online convex optimization algorithms, although they suffer from nonparametric rate of regret. In this paper, we propose the first computationally tractable online algorithm with regret guarantees that avoids offline planning over the state linear feedback policies. Our algorithm is based on the optimism in the face of uncertainty (OFU) principle in which we optimistically select the best model in a high confidence region. Our algorithm is then more explorative when compared to previous approaches. To overcome non-stationarity, we propose either a restarting strategy (R-OFU) or a sliding window (SW-OFU) strategy. With proper configuration, our algorithm is attains sublinear regret O(T^2/3). These algorithms utilize data from the current phase for tracking variations on the system dynamics. We corroborate our theoretical findings with numerical experiments, which highlight the effectiveness of our methods. To the best of our knowledge, our study establishes the first model-based online algorithm with regret guarantees under LTV dynamical systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/08/2020

Adaptive Regret for Control of Time-Varying Dynamics

We consider regret minimization for online control with time-varying lin...
research
02/16/2022

Online Control of Unknown Time-Varying Dynamical Systems

We study online control of time-varying linear systems with unknown dyna...
research
01/02/2023

Efficient Online Learning with Memory via Frank-Wolfe Optimization: Algorithms with Bounded Dynamic Regret and Applications to Control

Projection operations are a typical computation bottleneck in online lea...
research
07/13/2020

Black-Box Control for Linear Dynamical Systems

We consider the problem of controlling an unknown linear time-invariant ...
research
02/26/2021

A Regret Minimization Approach to Iterative Learning Control

We consider the setting of iterative learning control, or model-based po...
research
08/03/2018

Structured Neural Network Dynamics for Model-based Control

We present a structured neural network architecture that is inspired by ...
research
11/14/2022

Implications of Regret on Stability of Linear Dynamical Systems

The setting of an agent making decisions under uncertainty and under dyn...

Please sign up or login with your details

Forgot password? Click here to reset