Multi-Agent Reinforcement Learning for Long-Term Network Resource Allocation through Auction: a V2X Application

07/29/2022
by   Jing Tan, et al.
0

We formulate offloading of computational tasks from a dynamic group of mobile agents (e.g., cars) as decentralized decision making among autonomous agents. We design an interaction mechanism that incentivizes such agents to align private and system goals by balancing between competition and cooperation. In the static case, the mechanism provably has Nash equilibria with optimal resource allocation. In a dynamic environment, this mechanism's requirement of complete information is impossible to achieve. For such environments, we propose a novel multi-agent online learning algorithm that learns with partial, delayed and noisy state information, thus greatly reducing information need. Our algorithm is also capable of learning from long-term and sparse reward signals with varying delay. Empirical results from the simulation of a V2X application confirm that through learning, agents with the learning algorithm significantly improve both system and individual performance, reducing up to 30 increasing computation resource utilization and fairness. Results also confirm the algorithm's good convergence and generalization property in different environments.

READ FULL TEXT
research
04/05/2022

Multi-Agent Distributed Reinforcement Learning for Making Decentralized Offloading Decisions

We formulate computation offloading as a decentralized decision-making p...
research
04/05/2022

Learning to Bid Long-Term: Multi-Agent Reinforcement Learning with Long-Term and Sparse Reward in Repeated Auction Games

We propose a multi-agent distributed reinforcement learning algorithm th...
research
05/27/2022

Deep Reinforcement Learning for Distributed and Uncoordinated Cognitive Radios Resource Allocation

This paper presents a novel deep reinforcement learning-based resource a...
research
04/22/2020

Proactive Aging Mitigation in CGRAs through Utilization-Aware Allocation

Resource balancing has been effectively used to mitigate the long-term a...
research
05/05/2021

Fair and Truthful Mechanism with Limited Subsidy

The notion of envy-freeness is a natural and intuitive fairness requirem...
research
12/15/2020

Fast-Convergent Dynamics for Distributed Resource Allocation Over Sparse Time-Varying Networks

In this paper, distributed dynamics are deployed to solve resource alloc...
research
10/21/2020

Coordinated Online Learning for Multi-Agent Systems with Coupled Constraints and Perturbed Utility Observations

Competitive non-cooperative online decision-making agents whose actions ...

Please sign up or login with your details

Forgot password? Click here to reset