Provably Efficient Cooperative Multi-Agent Reinforcement Learning with Function Approximation

03/08/2021
by   Abhimanyu Dubey, et al.
12

Reinforcement learning in cooperative multi-agent settings has recently advanced significantly in its scope, with applications in cooperative estimation for advertising, dynamic treatment regimes, distributed control, and federated learning. In this paper, we discuss the problem of cooperative multi-agent RL with function approximation, where a group of agents communicates with each other to jointly solve an episodic MDP. We demonstrate that via careful message-passing and cooperative value iteration, it is possible to achieve near-optimal no-regret learning even with a fixed constant communication budget. Next, we demonstrate that even in heterogeneous cooperative settings, it is possible to achieve Pareto-optimal no-regret learning with limited communication. Our work generalizes several ideas from the multi-agent contextual and multi-armed bandit literature to MDPs and reinforcement learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/08/2023

Cooperative Multi-agent Bandits: Distributed Algorithms with Optimal Individual Regret and Constant Communication Costs

Recently, there has been extensive study of cooperative multi-agent mult...
research
08/08/2023

Communication-Efficient Cooperative Multi-Agent PPO via Regulated Segment Mixture in Internet of Vehicles

Multi-Agent Reinforcement Learning (MARL) has become a classic paradigm ...
research
02/11/2022

The Shapley Value in Machine Learning

Over the last few years, the Shapley value, a solution concept from coop...
research
05/10/2023

Cooperative Multi-Agent Reinforcement Learning: Asynchronous Communication and Linear Function Approximation

We study multi-agent reinforcement learning in the setting of episodic M...
research
08/14/2020

Cooperative Multi-Agent Bandits with Heavy Tails

We study the heavy-tailed stochastic bandit problem in the cooperative m...
research
08/16/2020

The reinforcement learning-based multi-agent cooperative approach for the adaptive speed regulation on a metallurgical pickling line

We present a holistic data-driven approach to the problem of productivit...
research
06/01/2020

A novel approach for multi-agent cooperative pursuit to capture grouped evaders

An approach of mobile multi-agent pursuit based on application of self-o...

Please sign up or login with your details

Forgot password? Click here to reset