Coordinated Online Learning for Multi-Agent Systems with Coupled Constraints and Perturbed Utility Observations

10/21/2020
by   Ezra Tampubolon, et al.
0

Competitive non-cooperative online decision-making agents whose actions increase congestion of scarce resources constitute a model for widespread modern large-scale applications. To ensure sustainable resource behavior, we introduce a novel method to steer the agents toward a stable population state, fulfilling the given coupled resource constraints. The proposed method is a decentralized resource pricing method based on the resource loads resulting from the augmentation of the game's Lagrangian. Assuming that the online learning agents have only noisy first-order utility feedback, we show that for a polynomially decaying agents' step size/learning rate, the population's dynamic will almost surely converge to generalized Nash equilibrium. A particular consequence of the latter is the fulfillment of resource constraints in the asymptotic limit. Moreover, we investigate the finite-time quality of the proposed algorithm by giving a nonasymptotic time decaying bound for the expected amount of resource constraint violation.

READ FULL TEXT

page 1

page 13

page 16

research
10/21/2019

Pricing Mechanism for Resource Sustainability in Competitive Online Learning Multi-Agent Systems

In this paper, we consider the problem of resource congestion control fo...
research
10/21/2019

Semi-Decentralized Coordinated Online Learning for Continuous Games with Coupled Constraints via Augmented Lagrangian

We consider a class of concave continuous games in which the correspondi...
research
02/14/2020

Resource-Aware Control via Dynamic Pricing for Congestion Game with Finite-Time Guarantees

Congestion game is a widely used model for modern networked applications...
research
06/19/2023

Taming the Exponential Action Set: Sublinear Regret and Fast Convergence to Nash Equilibrium in Online Congestion Games

The congestion game is a powerful model that encompasses a range of engi...
research
10/21/2019

Robust Online Learning for Resource Allocation – Beyond Euclidean Projection and Dynamic Fit

Online-learning literature has focused on designing algorithms that ensu...
research
07/29/2022

Multi-Agent Reinforcement Learning for Long-Term Network Resource Allocation through Auction: a V2X Application

We formulate offloading of computational tasks from a dynamic group of m...
research
12/29/2021

Socially-Optimal Mechanism Design for Incentivized Online Learning

Multi-arm bandit (MAB) is a classic online learning framework that studi...

Please sign up or login with your details

Forgot password? Click here to reset