Universal Policies to Learn Them All

08/24/2019
by   Hassam Ullah Sheikh, et al.
0

We explore a collaborative and cooperative multi-agent reinforcement learning setting where a team of reinforcement learning agents attempt to solve a single cooperative task in a multi-scenario setting. We propose a novel multi-agent reinforcement learning algorithm inspired by universal value function approximators that not only generalizes over state space but also over a set of different scenarios. Additionally, to prove our claim, we are introducing a challenging 2D multi-agent urban security environment where the learning agents are trying to protect a person from nearby bystanders in a variety of scenarios. Our study shows that state-of-the-art multi-agent reinforcement learning algorithms fail to generalize a single task over multiple scenarios while our proposed solution works equally well as scenario-dependent policies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/18/2020

Learning Complex Multi-Agent Policies in Presence of an Adversary

In recent years, there has been some outstanding work on applying deep r...
research
09/19/2023

Multicopy Reinforcement Learning Agents

This paper examines a novel type of multi-agent problem, in which an age...
research
03/28/2023

The challenge of redundancy on multi-agent value factorisation

In the field of cooperative multi-agent reinforcement learning (MARL), t...
research
06/07/2020

AI-QMIX: Attention and Imagination for Dynamic Multi-Agent Reinforcement Learning

Real world multi-agent tasks often involve varying types and quantities ...
research
08/28/2023

Policy Diversity for Cooperative Agents

Standard cooperative multi-agent reinforcement learning (MARL) methods a...
research
07/14/2021

Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot

Existing evaluation suites for multi-agent reinforcement learning (MARL)...
research
06/01/2023

EMOTE: An Explainable architecture for Modelling the Other Through Empathy

We can usually assume others have goals analogous to our own. This assum...

Please sign up or login with your details

Forgot password? Click here to reset