Implications of Regret on Stability of Linear Dynamical Systems

11/14/2022
by   Aren Karapetyan, et al.
0

The setting of an agent making decisions under uncertainty and under dynamic constraints is common for the fields of optimal control, reinforcement learning and recently also for online learning. In the online learning setting, the quality of an agent's decision is often quantified by the concept of regret, comparing the performance of the chosen decisions to the best possible ones in hindsight. While regret is a useful performance measure, when dynamical systems are concerned, it is important to also assess the stability of the closed-loop system for a chosen policy. In this work, we show that for linear state feedback policies and linear systems subject to adversarial disturbances, linear regret implies asymptotic stability in both time-varying and time-invariant settings. Conversely, we also show that bounded input bounded state (BIBS) stability and summability of the state transition matrices imply linear regret.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/02/2023

Efficient Online Learning with Memory via Frank-Wolfe Optimization: Algorithms with Bounded Dynamic Regret and Applications to Control

Projection operations are a typical computation bottleneck in online lea...
research
06/15/2020

Learning Expected Reward for Switched Linear Control Systems: A Non-Asymptotic View

In this work, we show existence of invariant ergodic measure for switche...
research
02/16/2022

Online Control of Unknown Time-Varying Dynamical Systems

We study online control of time-varying linear systems with unknown dyna...
research
12/12/2020

Generating Adversarial Disturbances for Controller Verification

We consider the problem of generating maximally adversarial disturbances...
research
06/06/2022

Learning to Control under Time-Varying Environment

This paper investigates the problem of regret minimization in linear tim...
research
06/09/2022

Regret Analysis of Certainty Equivalence Policies in Continuous-Time Linear-Quadratic Systems

This work studies theoretical performance guarantees of a ubiquitous rei...
research
04/29/2021

Stable Online Control of LTV Systems Stable Online Control of Linear Time-Varying Systems

Linear time-varying (LTV) systems are widely used for modeling real-worl...

Please sign up or login with your details

Forgot password? Click here to reset