Beyond Online Balanced Descent: An Optimal Algorithm for Smoothed Online Optimization

by   Gautam Goel, et al.

We study online convex optimization in a setting where the learner seeks to minimize the sum of a per-round hitting cost and a movement cost which is incurred when changing decisions between rounds. We prove a new lower bound on the competitive ratio of any online algorithm in the setting where the costs are m-strongly convex and the movement costs are the squared ℓ_2 norm. This lower bound shows that no algorithm can achieve a competitive ratio that is o(m^-1/2) as m tends to zero. No existing algorithms have competitive ratios matching this bound, and we show that the state-of-the-art algorithm, Online Balanced Decent (OBD), has a competitive ratio that is Ω(m^-2/3). We additionally propose two new algorithms, Greedy OBD (G-OBD) and Regularized OBD (R-OBD) and prove that both algorithms have an O(m^-1/2) competitive ratio. The result for G-OBD holds when the hitting costs are quasiconvex and the movement costs are the squared ℓ_2 norm, while the result for R-OBD holds when the hitting costs are m-strongly convex and the movement costs are Bregman Divergences. Further, we show that R-OBD simultaneously achieves constant, dimension-free competitive ratio and sublinear regret when hitting costs are strongly convex.


page 1

page 2

page 3

page 4


Smoothed Online Convex Optimization in High Dimensions via Online Balanced Descent

We study Smoothed Online Convex Optimization, a version of online convex...

Online Optimization with Predictions and Non-convex Losses

We study online optimization in a setting where an online learner seeks ...

Smoothed Online Optimization for Regression and Control

We consider Online Convex Optimization (OCO) in the setting where the co...

Online Optimization with Feedback Delay and Nonlinear Switching Cost

We study a variant of online optimization in which the learner receives ...

Online Optimization with Untrusted Predictions

We examine the problem of online optimization, where a decision maker mu...

Movement Penalized Bayesian Optimization with Application to Wind Energy Systems

Contextual Bayesian optimization (CBO) is a powerful framework for seque...

Pure entropic regularization for metrical task systems

We show that on every n-point HST metric, there is a randomized online a...

Please sign up or login with your details

Forgot password? Click here to reset