MA2CL:Masked Attentive Contrastive Learning for Multi-Agent Reinforcement Learning

06/03/2023
by   Haolin Song, et al.
0

Recent approaches have utilized self-supervised auxiliary tasks as representation learning to improve the performance and sample efficiency of vision-based reinforcement learning algorithms in single-agent settings. However, in multi-agent reinforcement learning (MARL), these techniques face challenges because each agent only receives partial observation from an environment influenced by others, resulting in correlated observations in the agent dimension. So it is necessary to consider agent-level information in representation learning for MARL. In this paper, we propose an effective framework called Multi-Agent Masked Attentive Contrastive Learning (MA2CL), which encourages learning representation to be both temporal and agent-level predictive by reconstructing the masked agent observation in latent space. Specifically, we use an attention reconstruction model for recovering and the model is trained via contrastive learning. MA2CL allows better utilization of contextual information at the agent level, facilitating the training of MARL agents for cooperation tasks. Extensive experiments demonstrate that our method significantly improves the performance and sample efficiency of different MARL algorithms and outperforms other methods in various vision-based and state-based scenarios. Our code can be found in <https://github.com/ustchlsong/MA2CL>

READ FULL TEXT

page 11

page 12

page 13

page 14

research
10/31/2022

Agent-Time Attention for Sparse Rewards Multi-Agent Reinforcement Learning

Sparse and delayed rewards pose a challenge to single agent reinforcemen...
research
02/06/2021

RIIT: Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning

In recent years, Multi-Agent Reinforcement Learning (MARL) has revolutio...
research
02/10/2023

Reinforcement Learning from Multiple Sensors via Joint Representations

In many scenarios, observations from more than one sensor modality are a...
research
09/26/2021

LINDA: Multi-Agent Local Information Decomposition for Awareness of Teammates

In cooperative multi-agent reinforcement learning (MARL), where agents o...
research
09/11/2023

Learning Geometric Representations of Objects via Interaction

We address the problem of learning representations from observations of ...
research
04/07/2022

Temporal Alignment for History Representation in Reinforcement Learning

Environments in Reinforcement Learning are usually only partially observ...
research
09/29/2021

Information-Bottleneck-Based Behavior Representation Learning for Multi-agent Reinforcement learning

In multi-agent deep reinforcement learning, extracting sufficient and co...

Please sign up or login with your details

Forgot password? Click here to reset