Causality Detection for Efficient Multi-Agent Reinforcement Learning

03/24/2023
by   Rafael Pina, et al.
0

When learning a task as a team, some agents in Multi-Agent Reinforcement Learning (MARL) may fail to understand their true impact in the performance of the team. Such agents end up learning sub-optimal policies, demonstrating undesired lazy behaviours. To investigate this problem, we start by formalising the use of temporal causality applied to MARL problems. We then show how causality can be used to penalise such lazy agents and improve their behaviours. By understanding how their local observations are causally related to the team reward, each agent in the team can adjust their individual credit based on whether they helped to cause the reward or not. We show empirically that using causality estimations in MARL improves not only the holistic performance of the team, but also the individual capabilities of each agent. We observe that the improvements are consistent in a set of different environments.

READ FULL TEXT

page 1

page 2

page 3

research
06/20/2023

Discovering Causality for Efficient Cooperation in Multi-Agent Environments

In cooperative Multi-Agent Reinforcement Learning (MARL) agents are requ...
research
05/20/2020

Causality, Responsibility and Blame in Team Plans

Many objectives can be achieved (or may be achieved more effectively) on...
research
01/05/2022

Offsetting Unequal Competition through RL-assisted Incentive Schemes

This paper investigates the dynamics of competition among organizations ...
research
07/05/2022

Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning

Successful deployment of multi-agent reinforcement learning often requir...
research
07/19/2022

Few-Shot Teamwork

We propose the novel few-shot teamwork (FST) problem, where skilled agen...
research
11/12/2021

Causal Multi-Agent Reinforcement Learning: Review and Open Problems

This paper serves to introduce the reader to the field of multi-agent re...
research
11/10/2020

Emergent Reciprocity and Team Formation from Randomized Uncertain Social Preferences

Multi-agent reinforcement learning (MARL) has shown recent success in in...

Please sign up or login with your details

Forgot password? Click here to reset