Online mirror descent and dual averaging: keeping pace in the dynamic case

06/03/2020
by   Huang Fang, et al.
0

Online mirror descent (OMD) and dual averaging (DA) are two fundamental algorithms for online convex optimization. They are known to have very similar (or even identical) performance guarantees in most scenarios when a fixed learning rate is used. However, for dynamic learning rates OMD is provably inferior to DA. It is known that, with a dynamic learning rate, OMD can suffer linear regret, even in common settings such as prediction with expert advice. This hints that the relationship between OMD and DA is not fully understood at present. In this paper, we modify the OMD algorithm by a simple technique that we call stabilization. We give essentially the same abstract regret bound for stabilized OMD and DA by modifying the classical OMD convergence analysis in a careful and modular way, yielding proofs that we believe to be clean and flexible. Simple corollaries of these bounds show that OMD with stabilization and DA enjoy the same performance guarantees in many applications even under dynamic learning rates. We also shed some light on the similarities between OMD and DA and show simple conditions under which stabilized OMD and DA generate the same iterates.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/30/2019

Unifying mirror descent and dual averaging

We introduce and analyse a new family of algorithms which generalizes an...
research
04/02/2016

Online Updating of Word Representations for Part-of-Speech Tagging

We propose online unsupervised domain adaptation (DA), which is performe...
research
05/22/2019

Convergence Analyses of Online ADAM Algorithm in Convex Setting and Two-Layer ReLU Neural Network

Nowadays, online learning is an appealing learning paradigm, which is of...
research
02/28/2019

A Dynamic Model Identification Package for the da Vinci Research Kit

The da Vinci Research Kit (dVRK) is a teleoperated surgical robotic syst...
research
01/31/2020

Dynamic and Distributed Online Convex Optimization for Demand Response of Commercial Buildings

We extend the regret analysis of the online distributed weighted dual av...
research
11/21/2020

An elastic framework for ensemble-based large-scale data assimilation

Prediction of chaotic systems relies on a floating fusion of sensor data...

Please sign up or login with your details

Forgot password? Click here to reset