Constrained Online Two-stage Stochastic Optimization: New Algorithms via Adversarial Learning

02/02/2023
by   Jiashuo Jiang, et al.
0

We consider an online two-stage stochastic optimization with long-term constraints over a finite horizon of T periods. At each period, we take the first-stage action, observe a model parameter realization and then take the second-stage action from a feasible set that depends both on the first-stage decision and the model parameter. We aim to minimize the cumulative objective value while guaranteeing that the long-term average second-stage decision belongs to a set. We propose a general algorithmic framework that derives online algorithms for the online two-stage problem from adversarial learning algorithms. Also, the regret bound of our algorithm cam be reduced to the regret bound of embedded adversarial learning algorithms. Based on our framework, we obtain new results under various settings. When the model parameter at each period is drawn from identical distributions, we derive state-of-art regret bound that improves previous bounds under special cases. Our algorithm is also robust to adversarial corruptions of model parameter realizations. When the model parameters are drawn from unknown non-stationary distributions and we are given prior estimates of the distributions, we develop a new algorithm from our framework with a regret O(W_T+√(T)), where W_T measures the total inaccuracy of the prior estimates.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/13/2020

Online Stochastic Optimization with Wasserstein Based Non-stationarity

We consider a general online stochastic optimization problem with multip...
research
05/18/2023

Online Resource Allocation in Episodic Markov Decision Processes

This paper studies a long-term resource allocation problem over multiple...
research
09/15/2022

A Unifying Framework for Online Optimization with Long-Term Constraints

We study online learning problems in which a decision maker has to take ...
research
05/05/2021

Two-Stage Stochastic Optimization via Primal-Dual Decomposition and Deep Unrolling

We consider a two-stage stochastic optimization problem, in which a long...
research
06/10/2020

Model-Free Algorithm and Regret Analysis for MDPs with Long-Term Constraints

In the optimization of dynamical systems, the variables typically have c...
research
02/17/2016

Online optimization and regret guarantees for non-additive long-term constraints

We consider online optimization in the 1-lookahead setting, where the ob...
research
07/19/2022

Regret Minimization with Noisy Observations

In a typical optimization problem, the task is to pick one of a number o...

Please sign up or login with your details

Forgot password? Click here to reset