Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning

07/05/2022
by   Lukas Schäfer, et al.
14

Successful deployment of multi-agent reinforcement learning often requires agents to adapt their behaviour. In this work, we discuss the problem of teamwork adaptation in which a team of agents needs to adapt their policies to solve novel tasks with limited fine-tuning. Motivated by the intuition that agents need to be able to identify and distinguish tasks in order to adapt their behaviour to the current task, we propose to learn multi-agent task embeddings (MATE). These task embeddings are trained using an encoder-decoder architecture optimised for reconstruction of the transition and reward functions which uniquely identify tasks. We show that a team of agents is able to adapt to novel tasks when provided with task embeddings. We propose three MATE training paradigms: independent MATE, centralised MATE, and mixed MATE which vary in the information used for the task encoding. We show that the embeddings learned by MATE identify tasks and provide useful information which agents leverage during adaptation to novel tasks.

READ FULL TEXT
research
07/19/2022

Few-Shot Teamwork

We propose the novel few-shot teamwork (FST) problem, where skilled agen...
research
02/09/2023

Learning Complex Teamwork Tasks using a Sub-task Curriculum

Training a team to complete a complex task via multi-agent reinforcement...
research
06/05/2023

Learning Embeddings for Sequential Tasks Using Population of Agents

We present an information-theoretic framework to learn fixed-dimensional...
research
03/24/2023

Causality Detection for Efficient Multi-Agent Reinforcement Learning

When learning a task as a team, some agents in Multi-Agent Reinforcement...
research
07/05/2022

The StarCraft Multi-Agent Challenges+ : Learning of Multi-Stage Tasks and Environmental Factors without Precise Reward Functions

In this paper, we propose a novel benchmark called the StarCraft Multi-A...
research
09/17/2019

Emergent Tool Use From Multi-Agent Autocurricula

Through multi-agent competition, the simple objective of hide-and-seek, ...
research
04/05/2019

Synthesized Policies for Transfer and Adaptation across Tasks and Environments

The ability to transfer in reinforcement learning is key towards buildin...

Please sign up or login with your details

Forgot password? Click here to reset