Policy Distillation and Value Matching in Multiagent Reinforcement Learning

03/15/2019
by   Samir Wadhwania, et al.
0

Multiagent reinforcement learning algorithms (MARL) have been demonstrated on complex tasks that require the coordination of a team of multiple agents to complete. Existing works have focused on sharing information between agents via centralized critics to stabilize learning or through communication to increase performance, but do not generally look at how information can be shared between agents to address the curse of dimensionality in MARL. We posit that a multiagent problem can be decomposed into a multi-task problem where each agent explores a subset of the state space instead of exploring the entire state space. This paper introduces a multiagent actor-critic algorithm and method for combining knowledge from homogeneous agents through distillation and value-matching that outperforms policy distillation alone and allows further learning in both discrete and continuous action spaces.

READ FULL TEXT
research
02/18/2022

Communication-Efficient Actor-Critic Methods for Homogeneous Markov Games

Recent success in cooperative multi-agent reinforcement learning (MARL) ...
research
11/26/2019

Dynamic Portfolio Management with Reinforcement Learning

Dynamic Portfolio Management is a domain that concerns the continuous re...
research
04/14/2021

Decomposed Soft Actor-Critic Method for Cooperative Multi-Agent Reinforcement Learning

Deep reinforcement learning methods have shown great performance on many...
research
10/23/2021

Fully Distributed Actor-Critic Architecture for Multitask Deep Reinforcement Learning

We propose a fully distributed actor-critic architecture, named Diff-DAC...
research
02/16/2018

Learning multiagent coordination in the absence of communication channels

In this work, we develop a reinforcement learning protocol for a multiag...
research
02/06/2019

Distilling Policy Distillation

The transfer of knowledge from one policy to another is an important too...
research
01/30/2023

Planning Multiple Epidemic Interventions with Reinforcement Learning

Combating an epidemic entails finding a plan that describes when and how...

Please sign up or login with your details

Forgot password? Click here to reset