A New Framework for Multi-Agent Reinforcement Learning – Centralized Training and Exploration with Decentralized Execution via Policy Distillation

10/21/2019
by   Gang Chen, et al.
0

Deep reinforcement learning (DRL) is a booming area of artificial intelligence. Many practical applications of DRL naturally involve more than one collaborative learners, making it important to study DRL in a multi-agent context. Previous research showed that effective learning in complex multi-agent systems demands for highly coordinated environment exploration among all the participating agents. Many researchers attempted to cope with this challenge through learning centralized value functions. However, the common strategy for every agent to learn their local policies directly often fail to nurture strong inter-agent collaboration and can be sample inefficient whenever agents alter their communication channels. To address these issues, we propose a new framework known as centralized training and exploration with decentralized execution via policy distillation. Guided by this framework and the maximum-entropy learning technique, we will first train agents' policies with shared global component to foster coordinated and effective learning. Locally executable policies will be derived subsequently from the trained global policies via policy distillation. Experiments show that our new framework and algorithm can achieve significantly better performance and higher sample efficiency than a cutting-edge baseline on several multi-agent DRL benchmarks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/27/2023

Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL?

Centralized Training with Decentralized Execution (CTDE) has recently em...
research
01/06/2023

Centralized Cooperative Exploration Policy for Continuous Control Tasks

The deep reinforcement learning (DRL) algorithm works brilliantly on sol...
research
09/29/2022

Hierarchical Training of Deep Ensemble Policies for Reinforcement Learning in Continuous Spaces

Many actor-critic deep reinforcement learning (DRL) algorithms have achi...
research
09/30/2021

A Privacy-preserving Distributed Training Framework for Cooperative Multi-agent Deep Reinforcement Learning

Deep Reinforcement Learning (DRL) sometimes needs a large amount of data...
research
06/12/2020

Human and Multi-Agent collaboration in a human-MARL teaming framework

Collaborative multi-agent reinforcement learning (MARL) as a specific ca...
research
09/19/2019

Multi-Robot Deep Reinforcement Learning with Macro-Actions

In many real-world multi-robot tasks, high-quality solutions often requi...
research
05/10/2023

Fast Teammate Adaptation in the Presence of Sudden Policy Change

In cooperative multi-agent reinforcement learning (MARL), where an agent...

Please sign up or login with your details

Forgot password? Click here to reset