α^α-Rank: Scalable Multi-agent Evaluation through Evolution

09/25/2019
by   Yaodong Yang, et al.
0

Although challenging, strategy profile evaluation in large connected learner networks is crucial for enabling the next wave of machine learning applications. Recently, α-Rank, an evolutionary algorithm, has been proposed as a solution for ranking joint policy profiles in multi-agent systems. α-Rank claimed scalability through a polynomial time implementation with respect to the total number of pure strategy profiles. In this paper, we formally prove that such a claim is not grounded. In fact, we show that α-Rank exhibits an exponential complexity in number of agents, hindering its application beyond a small finite number of joint profiles. Realizing such a limitation, we contribute by proposing a scalable evaluation protocol that we title α^α-Rank. Our method combines evolutionary dynamics with stochastic optimization and double oracles for truly scalable ranking with linear (in number of agents) time and memory complexities. Our contributions allow us, for the first time, to conduct large-scale evaluation experiments of multi-agent systems, where we show successful results on large joint strategy profiles with sizes in the order of O(2^25) (i.e., ≈33 million strategies) -- a setting not evaluable using current techniques.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/25/2019

α^α-Rank: Practically Scaling α-Rank through Stochastic Optimisation

Recently, α-Rank, a graph-based algorithm, has been proposed as a soluti...
research
03/04/2019

α-Rank: Multi-Agent Evaluation by Evolution

We introduce α-Rank, a principled evolutionary dynamics methodology, for...
research
05/17/2023

Synthesizing Resilient Strategies for Infinite-Horizon Objectives in Multi-Agent Systems

We consider the problem of synthesizing resilient and stochastically sta...
research
09/14/2020

Persistent And Scalable JADE: A Cloud based InMemory Multi-agent Framework

Multi-agent systems are often limited in terms of persistenceand scalabi...
research
01/05/2022

Conditional Imitation Learning for Multi-Agent Games

While advances in multi-agent learning have enabled the training of incr...
research
09/17/2018

Learning to Collaborate: Multi-Scenario Ranking via Multi-Agent Reinforcement Learning

Ranking is a fundamental and widely studied problem in scenarios such as...
research
03/01/2023

SCRIMP: Scalable Communication for Reinforcement- and Imitation-Learning-Based Multi-Agent Pathfinding

Trading off performance guarantees in favor of scalability, the Multi-Ag...

Please sign up or login with your details

Forgot password? Click here to reset