Off-Policy Action Anticipation in Multi-Agent Reinforcement Learning

04/04/2023
by   Ariyan Bighashdel, et al.
0

Learning anticipation in Multi-Agent Reinforcement Learning (MARL) is a reasoning paradigm where agents anticipate the learning steps of other agents to improve cooperation among themselves. As MARL uses gradient-based optimization, learning anticipation requires using Higher-Order Gradients (HOG), with so-called HOG methods. Existing HOG methods are based on policy parameter anticipation, i.e., agents anticipate the changes in policy parameters of other agents. Currently, however, these existing HOG methods have only been applied to differentiable games or games with small state spaces. In this work, we demonstrate that in the case of non-differentiable games with large state spaces, existing HOG methods do not perform well and are inefficient due to their inherent limitations related to policy parameter anticipation and multiple sampling stages. To overcome these problems, we propose Off-Policy Action Anticipation (OffPA2), a novel framework that approaches learning anticipation through action anticipation, i.e., agents anticipate the changes in actions of other agents, via off-policy sampling. We theoretically analyze our proposed OffPA2 and employ it to develop multiple HOG methods that are applicable to non-differentiable games with large state spaces. We conduct a large set of experiments and illustrate that our proposed HOG methods outperform the existing ones regarding efficiency and performance.

READ FULL TEXT

page 2

page 15

research
03/16/2023

Decentralized Multi-Agent Reinforcement Learning for Continuous-Space Stochastic Games

Stochastic games are a popular framework for studying multi-agent reinfo...
research
02/24/2023

Multi-Agent Reinforcement Learning with Common Policy for Antenna Tilt Optimization

This paper proposes a method for wireless network optimization applicabl...
research
06/08/2023

Negotiated Reasoning: On Provably Addressing Relative Over-Generalization

Over-generalization is a thorny issue in cognitive science, where people...
research
09/13/2017

Learning with Opponent-Learning Awareness

Multi-agent settings are quickly gathering importance in machine learnin...
research
08/13/2018

On Passivity, Reinforcement Learning and Higher-Order Learning in Multi-Agent Finite Games

In this paper, we propose a passivity-based methodology for analysis and...
research
10/24/2022

IDRL: Identifying Identities in Multi-Agent Reinforcement Learning with Ambiguous Identities

Multi-agent reinforcement learning(MARL) is a prevalent learning paradig...
research
02/23/2019

Rethinking Action Spaces for Reinforcement Learning in End-to-end Dialog Agents with Latent Variable Models

Defining action spaces for conversational agents and optimizing their de...

Please sign up or login with your details

Forgot password? Click here to reset