Scalable Primal-Dual Actor-Critic Method for Safe Multi-Agent RL with General Utilities

05/27/2023
by   Donghao Ying, et al.
0

We investigate safe multi-agent reinforcement learning, where agents seek to collectively maximize an aggregate sum of local objectives while satisfying their own safety constraints. The objective and constraints are described by general utilities, i.e., nonlinear functions of the long-term state-action occupancy measure, which encompass broader decision-making goals such as risk, exploration, or imitations. The exponential growth of the state-action space size with the number of agents presents challenges for global observability, further exacerbated by the global coupling arising from agents' safety constraints. To tackle this issue, we propose a primal-dual method utilizing shadow reward and κ-hop neighbor truncation under a form of correlation decay property, where κ is the communication radius. In the exact setting, our algorithm converges to a first-order stationary point (FOSP) at the rate of 𝒪(T^-2/3). In the sample-based setting, we demonstrate that, with high probability, our algorithm requires 𝒪(ϵ^-3.5) samples to achieve an ϵ-FOSP with an approximation error of 𝒪(ϕ_0^2κ), where ϕ_0∈ (0,1). Finally, we demonstrate the effectiveness of our model through extensive numerical experiments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/15/2023

Scalable Multi-Agent Reinforcement Learning with General Utilities

We study the scalable multi-agent reinforcement learning (MARL) with gen...
research
12/05/2019

Scalable Reinforcement Learning of Localized Policies for Multi-Agent Networked Systems

We study reinforcement learning (RL) in a setting with a network of agen...
research
06/11/2020

Distributed Reinforcement Learning in Multi-Agent Networked Systems

We study distributed reinforcement learning (RL) for a network of agents...
research
05/29/2021

MARL with General Utilities via Decentralized Shadow Reward Actor-Critic

We posit a new mechanism for cooperation in multi-agent reinforcement le...
research
06/11/2020

Scalable Multi-Agent Reinforcement Learning for Networked Systems with Average Reward

It has long been recognized that multi-agent reinforcement learning (MAR...
research
12/23/2022

A learning-based approach to multi-agent decision-making

We propose a learning-based methodology to reconstruct private informati...
research
06/13/2023

Provably Learning Nash Policies in Constrained Markov Potential Games

Multi-agent reinforcement learning (MARL) addresses sequential decision-...

Please sign up or login with your details

Forgot password? Click here to reset