Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning

03/02/2023
by   Marc Lanctot, et al.
0

Progress in fields of machine learning and adversarial planning has benefited significantly from benchmark domains, from checkers and the classic UCI data sets to Go and Diplomacy. In sequential decision-making, agent evaluation has largely been restricted to few interactions against experts, with the aim to reach some desired level of performance (e.g. beating a human professional player). We propose a benchmark for multiagent learning based on repeated play of the simple game Rock, Paper, Scissors along with a population of forty-three tournament entries, some of which are intentionally sub-optimal. We describe metrics to measure the quality of agents based both on average returns and exploitability. We then show that several RL, online learning, and language model approaches can learn good counter-strategies and generalize well, but ultimately lose to the top-performing bots, creating an opportunity for research in multiagent learning.

READ FULL TEXT

page 3

page 4

page 13

page 14

research
10/11/2022

Mastering the Game of No-Press Diplomacy via Human-Regularized Reinforcement Learning and Planning

No-press Diplomacy is a complex strategy game involving both cooperation...
research
09/18/2023

Mechanic Maker 2.0: Reinforcement Learning for Evaluating Generated Rules

Automated game design (AGD), the study of automatically generating game ...
research
06/26/2020

A Framework for Reinforcement Learning and Planning

Sequential decision making, commonly formalized as Markov Decision Proce...
research
03/15/2012

Automated Planning in Repeated Adversarial Games

Game theory's prescriptive power typically relies on full rationality an...
research
06/07/2023

Professional Basketball Player Behavior Synthesis via Planning with Diffusion

Dynamically planning in multi-agent systems has been explored to improve...
research
12/17/2018

Malthusian Reinforcement Learning

Here we explore a new algorithmic framework for multi-agent reinforcemen...
research
06/05/2023

Learning Embeddings for Sequential Tasks Using Population of Agents

We present an information-theoretic framework to learn fixed-dimensional...

Please sign up or login with your details

Forgot password? Click here to reset