Learning to Learn Group Alignment: A Self-Tuning Credo Framework with Multiagent Teams

04/14/2023
by   David Radke, et al.
0

Mixed incentives among a population with multiagent teams has been shown to have advantages over a fully cooperative system; however, discovering the best mixture of incentives or team structure is a difficult and dynamic problem. We propose a framework where individual learning agents self-regulate their configuration of incentives through various parts of their reward function. This work extends previous work by giving agents the ability to dynamically update their group alignment during learning and by allowing teammates to have different group alignment. Our model builds on ideas from hierarchical reinforcement learning and meta-learning to learn the configuration of a reward function that supports the development of a behavioral policy. We provide preliminary results in a commonly studied multiagent environment and find that agents can achieve better global outcomes by self-tuning their respective group alignment parameters.

READ FULL TEXT
research
11/19/2018

Scalable agent alignment via reward modeling: a research direction

One obstacle to applying reinforcement learning algorithms to real-world...
research
11/19/2020

Inverse Constrained Reinforcement Learning

Standard reinforcement learning (RL) algorithms train agents to maximize...
research
03/08/2021

Self-Supervised Online Reward Shaping in Sparse-Reward Environments

We propose a novel reinforcement learning framework that performs self-s...
research
06/28/2023

Towards a Better Understanding of Learning with Multiagent Teams

While it has long been recognized that a team of individual learning age...
research
03/04/2019

NoRML: No-Reward Meta Learning

Efficiently adapting to new environments and changes in dynamics is crit...
research
03/07/2017

Vocabulary Alignment in Openly Specified Interactions

The problem of achieving common understanding between agents that use di...
research
06/01/2020

A novel approach for multi-agent cooperative pursuit to capture grouped evaders

An approach of mobile multi-agent pursuit based on application of self-o...

Please sign up or login with your details

Forgot password? Click here to reset