Online Multiobjective Minimax Optimization and Applications

08/09/2021
by   Georgy Noarov, et al.
0

We introduce a simple but general online learning framework, in which at every round, an adaptive adversary introduces a new game, consisting of an action space for the learner, an action space for the adversary, and a vector valued objective function that is convex-concave in every coordinate. The learner and the adversary then play in this game. The learner's goal is to play so as to minimize the maximum coordinate of the cumulative vector-valued loss. The resulting one-shot game is not convex-concave, and so the minimax theorem does not apply. Nevertheless, we give a simple algorithm that can compete with the setting in which the adversary must announce their action first, with optimally diminishing regret. We demonstrate the power of our simple framework by using it to derive optimal bounds and algorithms across a variety of domains. This includes no regret learning: we can recover optimal algorithms and bounds for minimizing external regret, internal regret, adaptive regret, multigroup regret, subsequence regret, and a notion of regret in the sleeping experts setting. Next, we use it to derive a variant of Blackwell's Approachability Theorem, which we term "Fast Polytope Approachability". Finally, we are able to recover recently derived algorithms and bounds for online adversarial multicalibration and related notions (mean-conditioned moment multicalibration, and prediction interval multivalidity).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/18/2022

Online Convex Optimization with Unbounded Memory

Online convex optimization (OCO) is a widely used framework in online le...
research
01/23/2021

Optimistic and Adaptive Lagrangian Hedging

In online learning an algorithm plays against an environment with losses...
research
07/26/2023

Corruption-Robust Lipschitz Contextual Search

I study the problem of learning a Lipschitz function with corrupted bina...
research
11/09/2018

Policy Regret in Repeated Games

The notion of policy regret in online learning is a well defined? perfor...
research
10/31/2020

Prediction against limited adversary

We study the problem of prediction with expert advice with adversarial c...
research
02/23/2021

Learner-Private Online Convex Optimization

Online convex optimization is a framework where a learner sequentially q...
research
11/14/2010

Online Learning: Beyond Regret

We study online learnability of a wide class of problems, extending the ...

Please sign up or login with your details

Forgot password? Click here to reset