Information-Bottleneck-Based Behavior Representation Learning for Multi-agent Reinforcement learning

09/29/2021
by   Yue Jin, et al.
0

In multi-agent deep reinforcement learning, extracting sufficient and compact information of other agents is critical to attain efficient convergence and scalability of an algorithm. In canonical frameworks, distilling of such information is often done in an implicit and uninterpretable manner, or explicitly with cost functions not able to reflect the relationship between information compression and utility in representation. In this paper, we present Information-Bottleneck-based Other agents' behavior Representation learning for Multi-agent reinforcement learning (IBORM) to explicitly seek low-dimensional mapping encoder through which a compact and informative representation relevant to other agents' behaviors is established. IBORM leverages the information bottleneck principle to compress observation information, while retaining sufficient information relevant to other agents' behaviors used for cooperation decision. Empirical results have demonstrated that IBORM delivers the fastest convergence rate and the best performance of the learned policies, as compared with implicit behavior representation learning and explicit behavior representation learning without explicitly considering information compression and utility.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/18/2021

HAMMER: Multi-Level Coordination of Reinforcement Learning Agents via Learned Messaging

Cooperative multi-agent reinforcement learning (MARL) has achieved signi...
research
08/02/2022

Deep Reinforcement Learning for Multi-Agent Interaction

The development of autonomous agents which can interact with other agent...
research
04/25/2023

SEA: A Spatially Explicit Architecture for Multi-Agent Reinforcement Learning

Spatial information is essential in various fields. How to explicitly mo...
research
03/26/2021

Increasing the Efficiency of Policy Learning for Autonomous Vehicles by Multi-Task Representation Learning

Driving in a dynamic, multi-agent, and complex urban environment is a di...
research
06/03/2023

MA2CL:Masked Attentive Contrastive Learning for Multi-Agent Reinforcement Learning

Recent approaches have utilized self-supervised auxiliary tasks as repre...
research
11/17/2018

Parameter Sharing Reinforcement Learning Architecture for Multi Agent Driving Behaviors

Multi-agent learning provides a potential framework for learning and sim...
research
06/03/2011

Accelerating Reinforcement Learning through Implicit Imitation

Imitation can be viewed as a means of enhancing learning in multiagent e...

Please sign up or login with your details

Forgot password? Click here to reset