Smoothed Online Convex Optimization Based on Discounted-Normal-Predictor

05/02/2022
by   Moshe Y. Vardi, et al.
0

In this paper, we investigate an online prediction strategy named as Discounted-Normal-Predictor (Kapralov and Panigrahy, 2010) for smoothed online convex optimization (SOCO), in which the learner needs to minimize not only the hitting cost but also the switching cost. In the setting of learning with expert advice, Daniely and Mansour (2019) demonstrate that Discounted-Normal-Predictor can be utilized to yield nearly optimal regret bounds over any interval, even in the presence of switching costs. Inspired by their results, we develop a simple algorithm for SOCO: Combining online gradient descent (OGD) with different step sizes sequentially by Discounted-Normal-Predictor. Despite its simplicity, we prove that it is able to minimize the adaptive regret with switching cost, i.e., attaining nearly optimal regret with switching cost on every interval. By exploiting the theoretical guarantee of OGD for dynamic regret, we further show that the proposed algorithm can minimize the dynamic regret with switching cost in every interval.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/13/2021

Revisiting Smoothed Online Learning

In this paper, we revisit the problem of smoothed online learning, in wh...
research
04/07/2019

Competitive ratio versus regret minimization: achieving the best of both worlds

We consider online algorithms under both the competitive ratio criteria ...
research
11/25/2020

Leveraging Predictions in Smoothed Online Convex Optimization via Gradient-based Algorithms

We consider online convex optimization with time-varying stage costs and...
research
05/13/2022

Optimal Parameter-free Online Learning with Switching Cost

Parameter-freeness in online learning refers to the adaptivity of an alg...
research
11/09/2022

A Note on Task-Aware Loss via Reweighing Prediction Loss by Decision-Regret

In this short technical note we propose a baseline for decision-aware le...
research
04/23/2022

Smoothed Online Combinatorial Optimization Using Imperfect Predictions

Smoothed online combinatorial optimization considers a learner who repea...
research
10/29/2021

Online Optimization with Feedback Delay and Nonlinear Switching Cost

We study a variant of online optimization in which the learner receives ...

Please sign up or login with your details

Forgot password? Click here to reset