CAMMARL: Conformal Action Modeling in Multi Agent Reinforcement Learning

06/19/2023
by   Nikunj Gupta, et al.
0

Before taking actions in an environment with more than one intelligent agent, an autonomous agent may benefit from reasoning about the other agents and utilizing a notion of a guarantee or confidence about the behavior of the system. In this article, we propose a novel multi-agent reinforcement learning (MARL) algorithm CAMMARL, which involves modeling the actions of other agents in different situations in the form of confident sets, i.e., sets containing their true actions with a high probability. We then use these estimates to inform an agent's decision-making. For estimating such sets, we use the concept of conformal predictions, by means of which, we not only obtain an estimate of the most probable outcome but get to quantify the operable uncertainty as well. For instance, we can predict a set that provably covers the true predictions with high probabilities (e.g., 95 cooperative multi-agent tasks, we show that CAMMARL elevates the capabilities of an autonomous agent in MARL by modeling conformal prediction sets over the behavior of other agents in the environment and utilizing such estimates to enhance its policy learning. All developed codes can be found here: https://github.com/Nikunj-Gupta/conformal-agent-modelling.

READ FULL TEXT
research
02/26/2018

Modeling Others using Oneself in Multi-Agent Reinforcement Learning

We consider the multi-agent reinforcement learning setting with imperfec...
research
01/29/2020

Variational Autoencoders for Opponent Modeling in Multi-Agent Systems

Multi-agent systems exhibit complex behaviors that emanate from the inte...
research
06/06/2020

Learning to Model Opponent Learning

Multi-Agent Reinforcement Learning (MARL) considers settings in which a ...
research
06/06/2023

Agents Explore the Environment Beyond Good Actions to Improve Their Model for Better Decisions

Improving the decision-making capabilities of agents is a key challenge ...
research
11/08/2022

Policy-Based Reinforcement Learning for Assortative Matching in Human Behavior Modeling

This paper explores human behavior in virtual networked communities, spe...
research
08/11/2023

BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents

The massive successes of large language models (LLMs) encourage the emer...
research
01/23/2013

Artificial Decision Making Under Uncertainty in Intelligent Buildings

Our hypothesis is that by equipping certain agents in a multi-agent syst...

Please sign up or login with your details

Forgot password? Click here to reset