Leveraging World Model Disentanglement in Value-Based Multi-Agent Reinforcement Learning

09/08/2023
by   Zhizun Wang, et al.
0

In this paper, we propose a novel model-based multi-agent reinforcement learning approach named Value Decomposition Framework with Disentangled World Model to address the challenge of achieving a common goal of multiple agents interacting in the same environment with reduced sample complexity. Due to scalability and non-stationarity problems posed by multi-agent systems, model-free methods rely on a considerable number of samples for training. In contrast, we use a modularized world model, composed of action-conditioned, action-free, and static branches, to unravel the environment dynamics and produce imagined outcomes based on past experience, without sampling directly from the real environment. We employ variational auto-encoders and variational graph auto-encoders to learn the latent representations for the world model, which is merged with a value-based framework to predict the joint action-value function and optimize the overall training objective. We present experimental results in Easy, Hard, and Super-Hard StarCraft II micro-management challenges to demonstrate that our method achieves high sample efficiency and exhibits superior performance in defeating the enemy armies compared to other baselines.

READ FULL TEXT

page 3

page 6

research
04/20/2022

Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning

Recently, model-based agents have achieved better performance than model...
research
05/12/2023

Boosting Value Decomposition via Unit-Wise Attentive State Representation for Cooperative Multi-Agent Reinforcement Learning

In cooperative multi-agent reinforcement learning (MARL), the environmen...
research
04/12/2023

Bi-level Latent Variable Model for Sample-Efficient Multi-Agent Reinforcement Learning

Despite their potential in real-world applications, multi-agent reinforc...
research
03/16/2023

SVDE: Scalable Value-Decomposition Exploration for Cooperative Multi-Agent Reinforcement Learning

Value-decomposition methods, which reduce the difficulty of a multi-agen...
research
06/22/2022

PAC: Assisted Value Factorisation with Counterfactual Predictions in Multi-Agent Reinforcement Learning

Multi-agent reinforcement learning (MARL) has witnessed significant prog...
research
04/03/2023

Effective and Stable Role-Based Multi-Agent Collaboration by Structural Information Principles

Role-based learning is a promising approach to improving the performance...
research
12/30/2021

Multi-Agent Reinforcement Learning via Adaptive Kalman Temporal Difference and Successor Representation

Distributed Multi-Agent Reinforcement Learning (MARL) algorithms has att...

Please sign up or login with your details

Forgot password? Click here to reset