NA^2Q: Neural Attention Additive Model for Interpretable Multi-Agent Q-Learning

04/26/2023
by   Zichuan Liu, et al.
0

Value decomposition is widely used in cooperative multi-agent reinforcement learning, however, its implicit credit assignment mechanism is not yet fully understood due to black-box networks. In this work, we study an interpretable value decomposition framework via the family of generalized additive models. We present a novel method, named Neural Attention Additive Q-learning (NA^2Q), providing inherent intelligibility of collaboration behavior. NA^2Q can explicitly factorize the optimal joint policy induced by enriching shape functions to model all possible coalitions of agents into individual policies. Moreover, we construct identity semantics to promote estimating credits together with the global state and individual value functions, where local semantic masks help us diagnose whether each agent captures relevant-task information. Extensive experiments show that NA^2Q consistently achieves superior performance compared to different state-of-the-art methods on all challenging tasks, while yielding human-like interpretability.

READ FULL TEXT

page 6

page 7

page 16

page 19

page 20

research
07/21/2023

Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization

Offline reinforcement learning (RL) has received considerable attention ...
research
02/04/2023

Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative Multi-Agent Reinforcement Learning

Value decomposition methods have gradually become popular in the coopera...
research
02/10/2020

Qatten: A General Framework for Cooperative Multiagent Reinforcement Learning

In many real-world settings, a team of cooperative agents must learn to ...
research
11/23/2022

Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition

Value Decomposition (VD) aims to deduce the contributions of agents for ...
research
08/25/2023

Nonparametric Additive Value Functions: Interpretable Reinforcement Learning with an Application to Surgical Recovery

We propose a nonparametric additive model for estimating interpretable v...
research
05/25/2022

QGNN: Value Function Factorisation with Graph Neural Networks

In multi-agent reinforcement learning, the use of a global objective is ...
research
12/29/2021

DeepHAM: A Global Solution Method for Heterogeneous Agent Models with Aggregate Shocks

We propose an efficient, reliable, and interpretable global solution met...

Please sign up or login with your details

Forgot password? Click here to reset