Universal Algorithms: Beyond the Simplex

04/03/2020
by   Daron Anderson, et al.
0

The bulk of universal algorithms in the online convex optimisation literature are variants of the Hedge (exponential weights) algorithm on the simplex. While these algorithms extend to polytope domains by assigning weights to the vertices, this process is computationally unfeasible for many important classes of polytopes where the number V of vertices depends exponentially on the dimension d. In this paper we show the Subgradient algorithm is universal, meaning it has O(√(N)) regret in the antagonistic setting and O(1) pseudo-regret in the i.i.d setting, with two main advantages over Hedge: (1) The update step is more efficient as the action vectors have length only d rather than V; and (2) Subgradient gives better performance if the cost vectors satisfy Euclidean rather than sup-norm bounds. This paper extends the authors' recent results for Subgradient on the simplex. We also prove the same O(√(N)) and O(1) bounds when the domain is the unit ball. To the authors' knowledge this is the first instance of these bounds on a domain other than a polytope.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/10/2019

Optimality of the Subgradient Algorithm in the Stochastic Setting

Recently Jaouad Mourtada and Stéphane Gaïffas showed the anytime hedge a...
research
03/10/2021

Linear Bandits on Uniformly Convex Sets

Linear bandit algorithms yield 𝒪̃(n√(T)) pseudo-regret bounds on compact...
research
02/13/2023

Universal Online Optimization in Dynamic Environments via Uniclass Prediction

Recently, several universal methods have been proposed for online convex...
research
06/26/2019

Dual Adaptivity: A Universal Algorithm for Minimizing the Adaptive Regret of Convex Functions

To deal with changing environments, a new performance measure---adaptive...
research
10/06/2020

Online Linear Optimization with Many Hints

We study an online linear optimization (OLO) problem in which the learne...
research
05/10/2022

Universal Caching

In the learning literature, the performance of an online policy is commo...

Please sign up or login with your details

Forgot password? Click here to reset