Self-Paced Multi-Agent Reinforcement Learning

05/20/2022
by   Wenshuai Zhao, et al.
16

Curriculum reinforcement learning (CRL) aims to speed up learning of a task by changing gradually the difficulty of the task from easy to hard through control of factors such as initial state or environment dynamics. While automating CRL is well studied in the single-agent setting, in multi-agent reinforcement learning (MARL) an open question is whether control of the number of agents with other factors in a principled manner is beneficial, prior approaches typically relying on hand-crafted heuristics. In addition, how the tasks evolve as the number of agents changes remains understudied, which is critical for scaling to more challenging tasks. We introduce self-paced MARL (SPMARL) that enables optimizing the number of agents with other environment factors in a principled way, and, show that usual assumptions such as that fewer agents make the task always easier are not generally valid. The curriculum induced by SPMARL reveals the evolution of tasks w.r.t. number of agents and experiments show that SPMARL improves the performance when the number of agents sufficiently influences task difficulty.

READ FULL TEXT

page 2

page 7

research
03/06/2023

MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning

Open-ended learning methods that automatically generate a curriculum of ...
research
02/07/2023

Towards Skilled Population Curriculum for Multi-Agent Reinforcement Learning

Recent advances in multi-agent reinforcement learning (MARL) allow agent...
research
03/23/2020

Evolutionary Population Curriculum for Scaling Multi-Agent Reinforcement Learning

In multi-agent games, the complexity of the environment can grow exponen...
research
11/08/2021

Variational Automatic Curriculum Learning for Sparse-Reward Cooperative Multi-Agent Problems

We introduce a curriculum learning algorithm, Variational Automatic Curr...
research
02/11/2022

Cooperative Solutions to Exploration Tasks Under Speed and Budget Constraints

We present a multi-agent system where agents can cooperate to solve a sy...
research
07/12/2023

Maneuver Decision-Making Through Automatic Curriculum Reinforcement Learning Without Handcrafted Reward functions

Maneuver decision-making is the core of unmanned combat aerial vehicle f...
research
10/20/2020

Negotiating Team Formation Using Deep Reinforcement Learning

When autonomous agents interact in the same environment, they must often...

Please sign up or login with your details

Forgot password? Click here to reset