Online and Bandit Algorithms for Nonstationary Stochastic Saddle-Point Optimization

12/03/2019
by   Abhishek Roy, et al.
0

Saddle-point optimization problems are an important class of optimization problems with applications to game theory, multi-agent reinforcement learning and machine learning. A majority of the rich literature available for saddle-point optimization has focused on the offline setting. In this paper, we study nonstationary versions of stochastic, smooth, strongly-convex and strongly-concave saddle-point optimization problem, in both online (or first-order) and multi-point bandit (or zeroth-order) settings. We first propose natural notions of regret for such nonstationary saddle-point optimization problems. We then analyze extragradient and Frank-Wolfe algorithms, for the unconstrained and constrained settings respectively, for the above class of nonstationary saddle-point optimization problems. We establish sub-linear regret bounds on the proposed notions of regret in both the online and bandit setting.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/31/2019

Multi-Point Bandit Algorithms for Nonstationary Online Nonconvex Optimization

Bandit algorithms have been predominantly analyzed in the convex setting...
research
03/09/2023

Gauges and Accelerated Optimization over Smooth and/or Strongly Convex Sets

We consider feasibility and constrained optimization problems defined ov...
research
10/21/2020

Efficient Projection-Free Algorithms for Saddle Point Problems

The Frank-Wolfe algorithm is a classic method for constrained optimizati...
research
10/14/2020

Boosting One-Point Derivative-Free Online Optimization via Residual Feedback

Zeroth-order optimization (ZO) typically relies on two-point feedback to...
research
06/09/2023

A Central Limit Theorem for Stochastic Saddle Point Optimization

In this work, we study the Uncertainty Quantification (UQ) of an algorit...
research
02/08/2023

PASTA: Pessimistic Assortment Optimization

We consider a class of assortment optimization problems in an offline da...
research
09/11/2012

On the Complexity of Bandit and Derivative-Free Stochastic Convex Optimization

The problem of stochastic convex optimization with bandit feedback (in t...

Please sign up or login with your details

Forgot password? Click here to reset