Probabilistic Recursive Reasoning for Multi-Agent Reinforcement Learning

01/26/2019
by   Ying Wen, et al.
12

Humans are capable of attributing latent mental contents such as beliefs, or intentions to others. The social skill is critical in everyday life to reason about the potential consequences of their behaviors so as to plan ahead. It is known that humans use this reasoning ability recursively, i.e. considering what others believe about their own beliefs. In this paper, we start from level-1 recursion and introduce a probabilistic recursive reasoning (PR2) framework for multi-agent reinforcement learning. Our hypothesis is that it is beneficial for each agent to account for how the opponents would react to its future behaviors. Under the PR2 framework, we adopt variational Bayes methods to approximate the opponents' conditional policy, to which each agent finds the best response and then improve their own policy. We develop decentralized-training-decentralized-execution algorithms, PR2-Q and PR2-Actor-Critic, that are proved to converge in the self-play scenario when there is one Nash equilibrium. Our methods are tested on both the matrix game and the differential game, which have a non-trivial equilibrium where common gradient-based methods fail to converge. Our experiments show that it is critical to reason about how the opponents believe about what the agent believes. We expect our work to contribute a new idea of modeling the opponents to the multi-agent reinforcement learning community.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/26/2019

Multi-Agent Generalized Recursive Reasoning

We propose a new reasoning protocol called generalized recursive reasoni...
research
03/06/2022

Recursive Reasoning Graph for Multi-Agent Reinforcement Learning

Multi-agent reinforcement learning (MARL) provides an efficient way for ...
research
09/08/2019

Bi-level Actor-Critic for Multi-agent Coordination

Coordination is one of the essential problems in multi-agent systems. Ty...
research
04/09/2020

Re-conceptualising the Language Game Paradigm in the Framework of Multi-Agent Reinforcement Learning

In this paper, we formulate the challenge of re-conceptualising the lang...
research
08/04/2021

Model-Based Opponent Modeling

When one agent interacts with a multi-agent environment, it is challengi...
research
09/02/2021

Multi-Agent Inverse Reinforcement Learning: Suboptimal Demonstrations and Alternative Solution Concepts

Multi-agent inverse reinforcement learning (MIRL) can be used to learn r...
research
04/03/2020

Reselling Information

Information is replicable in that it can be simultaneously consumed and ...

Please sign up or login with your details

Forgot password? Click here to reset