Pseudonorm Approachability and Applications to Regret Minimization

02/03/2023
by   Christoph Dann, et al.
0

Blackwell's celebrated approachability theory provides a general framework for a variety of learning problems, including regret minimization. However, Blackwell's proof and implicit algorithm measure approachability using the ℓ_2 (Euclidean) distance. We argue that in many applications such as regret minimization, it is more useful to study approachability under other distance metrics, most commonly the ℓ_∞-metric. But, the time and space complexity of the algorithms designed for ℓ_∞-approachability depend on the dimension of the space of the vectorial payoffs, which is often prohibitively large. Thus, we present a framework for converting high-dimensional ℓ_∞-approachability problems to low-dimensional pseudonorm approachability problems, thereby resolving such issues. We first show that the ℓ_∞-distance between the average payoff and the approachability set can be equivalently defined as a pseudodistance between a lower-dimensional average vector payoff and a new convex set we define. Next, we develop an algorithmic theory of pseudonorm approachability, analogous to previous work on approachability for ℓ_2 and other norms, showing that it can be achieved via online linear optimization (OLO) over a convex set given by the Fenchel dual of the unit pseudonorm ball. We then use that to show, modulo mild normalization assumptions, that there exists an ℓ_∞-approachability algorithm whose convergence is independent of the dimension of the original vectorial payoff. We further show that that algorithm admits a polynomial-time complexity, assuming that the original ℓ_∞-distance can be computed efficiently. We also give an ℓ_∞-approachability algorithm whose convergence is logarithmic in that dimension using an FTRL algorithm with a maximum-entropy regularizer.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/08/2020

Refined approachability algorithms and application to regret minimization with global costs

Blackwell's approachability is a framework where two players, the Decisi...
research
03/24/2013

Efficient Reinforcement Learning for High Dimensional Linear Quadratic Systems

We study the problem of adaptive control of a high dimensional linear qu...
research
07/21/2023

An Efficient Interior-Point Method for Online Convex Optimization

A new algorithm for regret minimization in online convex optimization is...
research
02/09/2023

Projection-free Online Exp-concave Optimization

We consider the setting of online convex optimization (OCO) with exp-con...
research
05/15/2023

Convex optimization over a probability simplex

We propose a new iteration scheme, the Cauchy-Simplex, to optimize conve...
research
08/08/2017

Time-Space Tradeoffs for Learning from Small Test Spaces: Learning Low Degree Polynomial Functions

We develop an extension of recently developed methods for obtaining time...

Please sign up or login with your details

Forgot password? Click here to reset