QGNN: Value Function Factorisation with Graph Neural Networks

05/25/2022
by   Ryan Kortvelesy, et al.
11

In multi-agent reinforcement learning, the use of a global objective is a powerful tool for incentivising cooperation. Unfortunately, it is not sample-efficient to train individual agents with a global reward, because it does not necessarily correlate with an agent's individual actions. This problem can be solved by factorising the global value function into local value functions. Early work in this domain performed factorisation by conditioning local value functions purely on local information. Recently, it has been shown that providing both local information and an encoding of the global state can promote cooperative behaviour. In this paper we propose QGNN, the first value factorisation method to use a graph neural network (GNN) based model. The multi-layer message passing architecture of QGNN provides more representational complexity than models in prior work, allowing it to produce a more effective factorisation. QGNN also introduces a permutation invariant mixer which is able to match the performance of other methods, even with significantly fewer parameters. We evaluate our method against several baselines, including QMIX-Att, GraphMIX, QMIX, VDN, and hybrid architectures. Our experiments include Starcraft, the standard benchmark for credit assignment; Estimate Game, a custom environment that explicitly models inter-agent dependencies; and Coalition Structure Generation, a foundational problem with real-world applications. The results show that QGNN outperforms state-of-the-art value factorisation baselines consistently.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/01/2021

Shapley Counterfactual Credits for Multi-Agent Reinforcement Learning

Centralized Training with Decentralized Execution (CTDE) has been a popu...
research
10/09/2020

Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning

We propose a novel framework for value function factorization in multi-a...
research
09/27/2019

Deep Coordination Graphs

This paper introduces the deep coordination graph (DCG) for collaborativ...
research
12/24/2020

Cooperative Policy Learning with Pre-trained Heterogeneous Observation Representations

Multi-agent reinforcement learning (MARL) has been increasingly explored...
research
04/26/2023

NA^2Q: Neural Attention Additive Model for Interpretable Multi-Agent Q-Learning

Value decomposition is widely used in cooperative multi-agent reinforcem...
research
04/23/2022

Graph Neural Network based Agent in Google Research Football

Deep neural networks (DNN) can approximate value functions or policies f...
research
08/17/2022

Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning

Goal-conditioned reinforcement learning (GCRL) has a wide range of poten...

Please sign up or login with your details

Forgot password? Click here to reset