Online Saddle Point Problem with Applications to Constrained Online Convex Optimization

06/21/2018
by   Adrian Rivera, et al.
0

We study an online saddle point problem where at each iteration a pair of actions need to be chosen without knowledge of the future (convex-concave) payoff functions. The objective is to minimize the gap between the cumulative payoffs and the saddle point value of the aggregate payoff function, which we measure using a metric called "SP-regret". The problem generalizes the online convex optimization framework and can be interpreted as finding the Nash equilibrium for the aggregate of a sequence of two-player zero-sum games. We propose an algorithm that achieves Õ(√(T)) SP-regret in the general case, and O( T) SP-regret for the strongly convex-concave case. We then consider a constrained online convex optimization problem motivated by a variety of applications in dynamic pricing, auctions, and crowdsourcing. We relate this problem to an online saddle point problem and establish O(√(T)) regret using a primal-dual algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/29/2022

No-Regret Learning in Network Stochastic Zero-Sum Games

No-regret learning has been widely used to compute a Nash equilibrium in...
research
01/09/2020

On Distributed Online Convex Optimization with Sublinear Dynamic Regret and Fit

In this work, we consider a distributed online convex optimization probl...
research
07/21/2023

An Efficient Interior-Point Method for Online Convex Optimization

A new algorithm for regret minimization in online convex optimization is...
research
06/13/2018

Minimizing Regret in Bandit Online Optimization in Unconstrained and Constrained Action Spaces

We consider online convex optimization with zeroth-order feedback settin...
research
09/12/2021

Concave Utility Reinforcement Learning with Zero-Constraint Violations

We consider the problem of tabular infinite horizon concave utility rein...
research
05/23/2018

Efficient online algorithms for fast-rate regret bounds under sparsity

We consider the online convex optimization problem. In the setting of ar...
research
04/15/2020

Online Multiserver Convex Chasing and Optimization

We introduce the problem of k-chasing of convex functions, a simultaneou...

Please sign up or login with your details

Forgot password? Click here to reset