Value-Decomposition Multi-Agent Actor-Critics

07/24/2020
by   Jianyu Su, et al.
0

The exploitation of extra state information has been an active research area in multi-agent reinforcement learning (MARL). QMIX represents the joint action-value using a non-negative function approximator and achieves the best performance, by far, on multi-agent benchmarks, StarCraft II micromanagement tasks. However, our experiments show that, in some cases, QMIX is incompatible with A2C, a training paradigm that promotes algorithm training efficiency. To obtain a reasonable trade-off between training efficiency and algorithm performance, we extend value-decomposition to actor-critics that are compatible with A2C and propose a novel actor-critic framework, value-decomposition actor-critics (VDACs). We evaluate VDACs on the testbed of StarCraft II micromanagement tasks and demonstrate that the proposed framework improves median performance over other actor-critic methods. Furthermore, we use a set of ablation experiments to identify the key factors that contribute to the performance of VDACs.

READ FULL TEXT

page 7

page 8

page 9

page 14

research
10/06/2021

Cooperative Multi-Agent Actor-Critic for Privacy-Preserving Load Scheduling in a Residential Microgrid

As a scalable data-driven approach, multi-agent reinforcement learning (...
research
10/22/2022

Solving Continuous Control via Q-learning

While there has been substantial success in applying actor-critic method...
research
06/08/2022

Stabilizing Voltage in Power Distribution Networks via Multi-Agent Reinforcement Learning with Transformer

The increased integration of renewable energy poses a slew of technical ...
research
12/28/2020

Federated Multi-Agent Actor-Critic Learning for Age Sensitive Mobile Edge Computing

As an emerging technique, mobile edge computing (MEC) introduces a new p...
research
03/23/2023

Stochastic Graph Neural Network-based Value Decomposition for MARL in Internet of Vehicles

Autonomous driving has witnessed incredible advances in the past several...
research
02/09/2023

Quantum Multi-Agent Actor-Critic Networks for Cooperative Mobile Access in Multi-UAV Systems

This paper proposes a novel quantum multi-agent actor-critic networks (Q...
research
12/22/2021

Alpha-Mini: Minichess Agent with Deep Reinforcement Learning

We train an agent to compete in the game of Gardner minichess, a downsiz...

Please sign up or login with your details

Forgot password? Click here to reset