Understand Dynamic Regret with Switching Cost for Online Decision Making

11/28/2019
by   Yawei Zhao, et al.
0

As a metric to measure the performance of an online method, dynamic regret with switching cost has drawn much attention for online decision making problems. Although the sublinear regret has been provided in many previous researches, we still have little knowledge about the relation between the dynamic regret and the switching cost. In the paper, we investigate the relation for two classic online settings: Online Algorithms (OA) and Online Convex Optimization (OCO). We provide a new theoretical analysis framework, which shows an interesting observation, that is, the relation between the switching cost and the dynamic regret is different for settings of OA and OCO. Specifically, the switching cost has significant impact on the dynamic regret in the setting of OA. But, it does not have an impact on the dynamic regret in the setting of OCO. Furthermore, we provide a lower bound of regret for the setting of OCO, which is same with the lower bound in the case of no switching cost. It shows that the switching cost does not change the difficulty of online decision making problems in the setting of OCO.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/21/2021

Online Convex Optimization with Continuous Switching Constraint

In many sequential decision making applications, the change of decision ...
research
02/13/2021

Revisiting Smoothed Online Learning

In this paper, we revisit the problem of smoothed online learning, in wh...
research
09/24/2022

Improving the Bounds of the Online Dynamic Power Management Problem

We investigate the power-down mechanism which decides when a machine tra...
research
04/23/2022

Smoothed Online Combinatorial Optimization Using Imperfect Predictions

Smoothed online combinatorial optimization considers a learner who repea...
research
04/15/2016

Real-Time Contingency Analysis with Corrective Transmission Switching - Part II: Results and Discussion

This paper presents the performance of an AC transmission switching (TS)...
research
12/26/2018

Dynamic Online Gradient Descent with Improved Query Complexity: A Theoretical Revisit

We provide a new theoretical analysis framework to investigate online gr...
research
05/30/2019

Provably Efficient Q-Learning with Low Switching Cost

We take initial steps in studying PAC-MDP algorithms with limited adapti...

Please sign up or login with your details

Forgot password? Click here to reset