CESMA: Centralized Expert Supervises Multi-Agents

02/06/2019
by   Alex Tong Lin, et al.
22

We consider the reinforcement learning problem of training multiple agents in order to maximize a shared reward. In this multi-agent system, each agent seeks to maximize the reward while interacting with other agents, and they may or may not be able to communicate. Typically the agents do not have access to other agent policies and thus each agent observes a non-stationary and partially-observable environment. In order to resolve this issue, we demonstrate a novel multi-agent training framework that first turns a multi-agent problem into a single-agent problem to obtain a centralized expert that is then used to guide supervised learning for multiple independent agents with the goal of decentralizing the policy. We additionally demonstrate a way to turn the exponential growth in the joint action space into a linear growth for the centralized policy. Overall, the problem is twofold: the problem of obtaining a centralized expert, and then the problem of supervised learning to train the multi-agents. We demonstrate our solutions to both of these tasks, and show that supervised learning can be used to decentralize a multi-agent policy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/05/2019

Exploration with Unreliable Intrinsic Reward in Multi-Agent Reinforcement Learning

This paper investigates the use of intrinsic reward to guide exploration...
research
06/01/2022

DM^2: Distributed Multi-Agent Reinforcement Learning for Distribution Matching

Current approaches to multi-agent cooperation rely heavily on centralize...
research
10/22/2020

Multi-agent active perception with prediction rewards

Multi-agent active perception is a task where a team of agents cooperati...
research
01/19/2022

Improving Behavioural Cloning with Human-Driven Dynamic Dataset Augmentation

Behavioural cloning has been extensively used to train agents and is rec...
research
05/31/2019

Attentional Policies for Cross-Context Multi-Agent Reinforcement Learning

Many potential applications of reinforcement learning in the real world ...
research
02/07/2023

MACOptions: Multi-Agent Learning with Centralized Controller and Options Framework

These days automation is being applied everywhere. In every environment,...
research
12/10/2018

Learning Sharing Behaviors with Arbitrary Numbers of Agents

We propose a method for modeling and learning turn-taking behaviors for ...

Please sign up or login with your details

Forgot password? Click here to reset