Robust Online Learning for Resource Allocation – Beyond Euclidean Projection and Dynamic Fit

10/21/2019
by   Ezra Tampubolon, et al.
0

Online-learning literature has focused on designing algorithms that ensure sub-linear growth of the cumulative long-term constraint violations. The drawback of this guarantee is that strictly feasible actions may cancel out constraint violations on other time slots. For this reason, we introduce a new performance measure called , whose particular instance is the cumulative positive part of the constraint violations. We propose a class of non-causal algorithms for online-decision making, which guarantees, in slowly changing environments, sub-linear growth of this quantity despite noisy first-order feedback. Furthermore, we demonstrate by numerical experiments the performance gain of our method relative to the state of art.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/19/2018

Online Convex Optimization for Cumulative Constraints

We propose an algorithm for online convex optimization which examines a ...
research
01/14/2017

An Online Convex Optimization Approach to Dynamic Network Resource Allocation

Existing approaches to online convex optimization (OCO) make sequential ...
research
05/29/2020

Online DR-Submodular Maximization with Stochastic Cumulative Constraints

In this paper, we consider online continuous DR-submodular maximization ...
research
10/21/2020

Coordinated Online Learning for Multi-Agent Systems with Coupled Constraints and Perturbed Utility Observations

Competitive non-cooperative online decision-making agents whose actions ...
research
10/30/2022

Online Convex Optimization with Long Term Constraints for Predictable Sequences

In this paper, we investigate the framework of Online Convex Optimizatio...
research
05/31/2023

Towards Fair Disentangled Online Learning for Changing Environments

In the problem of online learning for changing environments, data are se...
research
02/09/2022

Online Learning to Transport via the Minimal Selection Principle

Motivated by robust dynamic resource allocation in operations research, ...

Please sign up or login with your details

Forgot password? Click here to reset