Learning Embeddings for Sequential Tasks Using Population of Agents

06/05/2023
by   Mridul Mahajan, et al.
0

We present an information-theoretic framework to learn fixed-dimensional embeddings for tasks in reinforcement learning. We leverage the idea that two tasks are similar to each other if observing an agent's performance on one task reduces our uncertainty about its performance on the other. This intuition is captured by our information-theoretic criterion which uses a diverse population of agents to measure similarity between tasks in sequential decision-making settings. In addition to qualitative assessment, we empirically demonstrate the effectiveness of our techniques based on task embeddings by quantitative comparisons against strong baselines on two application scenarios: predicting an agent's performance on a test task by observing its performance on a small quiz of tasks, and selecting tasks with desired characteristics from a given set of options.

READ FULL TEXT

page 6

page 16

research
11/02/2020

Information-theoretic Task Selection for Meta-Reinforcement Learning

In Meta-Reinforcement Learning (meta-RL) an agent is trained on a set of...
research
07/05/2022

Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning

Successful deployment of multi-agent reinforcement learning often requir...
research
11/21/2019

Information-Theoretic Confidence Bounds for Reinforcement Learning

We integrate information-theoretic concepts into the design and analysis...
research
07/27/2021

Information-Theoretic Based Target Search with Multiple Agents

This paper proposes an online path planning and motion generation algori...
research
05/17/2023

Pragmatic Reasoning in Structured Signaling Games

In this work we introduce a structured signaling game, an extension of t...
research
03/02/2023

Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning

Progress in fields of machine learning and adversarial planning has bene...
research
01/12/2012

Sparse Reward Processes

We introduce a class of learning problems where the agent is presented w...

Please sign up or login with your details

Forgot password? Click here to reset