Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning

04/20/2022
by   Zhiwei Xu, et al.
0

Recently, model-based agents have achieved better performance than model-free ones using the same computational budget and training time in single-agent environments. However, due to the complexity of multi-agent systems, it is tough to learn the model of the environment. The significant compounding error may hinder the learning process when model-based methods are applied to multi-agent tasks. This paper proposes an implicit model-based multi-agent reinforcement learning method based on value decomposition methods. Under this method, agents can interact with the learned virtual environment and evaluate the current state value according to imagined future states in the latent space, making agents have the foresight. Our approach can be applied to any multi-agent value decomposition method. The experimental results show that our method improves the sample efficiency in different partially observable Markov decision process domains.

READ FULL TEXT
research
06/11/2020

Multi-Agent Informational Learning Processes

We introduce a new mathematical model of multi-agent reinforcement learn...
research
03/20/2022

Model-based Multi-agent Reinforcement Learning: Recent Progress and Prospects

Significant advances have recently been achieved in Multi-Agent Reinforc...
research
01/15/2020

Model-based Multi-Agent Reinforcement Learning with Cooperative Prioritized Sweeping

We present a new model-based reinforcement learning algorithm, Cooperati...
research
01/29/2019

Multi Agent Reinforcement Learning with Multi-Step Generative Models

The dynamics between agents and the environment are an important compone...
research
09/08/2023

Leveraging World Model Disentanglement in Value-Based Multi-Agent Reinforcement Learning

In this paper, we propose a novel model-based multi-agent reinforcement ...
research
05/13/2021

SIDE: I Infer the State I Want to Learn

As one of the solutions to the Dec-POMDP problem, the value decompositio...
research
03/06/2018

Intent-aware Multi-agent Reinforcement Learning

This paper proposes an intent-aware multi-agent planning framework as we...

Please sign up or login with your details

Forgot password? Click here to reset