Online optimization and regret guarantees for non-additive long-term constraints

02/17/2016
by   Rodolphe Jenatton, et al.
0

We consider online optimization in the 1-lookahead setting, where the objective does not decompose additively over the rounds of the online game. The resulting formulation enables us to deal with non-stationary and/or long-term constraints , which arise, for example, in online display advertising problems. We propose an on-line primal-dual algorithm for which we obtain dynamic cumulative regret guarantees. They depend on the convexity and the smoothness of the non-additive penalty, as well as terms capturing the smoothness with which the residuals of the non-stationary and long-term constraints vary over the rounds. We conduct experiments on synthetic data to illustrate the benefits of the non-additive penalty and show vanishing regret convergence on live traffic data collected by a display advertising platform in production.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/15/2022

A Unifying Framework for Online Optimization with Long-Term Constraints

We study online learning problems in which a decision maker has to take ...
research
07/10/2023

Online Ad Procurement in Non-stationary Autobidding Worlds

Today's online advertisers procure digital ad impressions through intera...
research
12/23/2015

Adaptive Algorithms for Online Convex Optimization with Long-term Constraints

We present an adaptive online gradient descent algorithm to solve online...
research
06/09/2021

Regret and Cumulative Constraint Violation Analysis for Online Convex Optimization with Long Term Constraints

This paper considers online convex optimization with long term constrain...
research
06/03/2020

Non-Stationary Bandits with Intermediate Observations

Online recommender systems often face long delays in receiving feedback,...
research
02/02/2023

Constrained Online Two-stage Stochastic Optimization: New Algorithms via Adversarial Learning

We consider an online two-stage stochastic optimization with long-term c...
research
06/24/2021

Improved Regret Bounds for Tracking Experts with Memory

We address the problem of sequential prediction with expert advice in a ...

Please sign up or login with your details

Forgot password? Click here to reset