Hedging Algorithms and Repeated Matrix Games

10/15/2018
by   Bruno Bouzy, et al.
0

Playing repeated matrix games (RMG) while maximizing the cumulative returns is a basic method to evaluate multi-agent learning (MAL) algorithms. Previous work has shown that UCB, M3, S or Exp3 algorithms have good behaviours on average in RMG. Besides, hedging algorithms have been shown to be effective on prediction problems. An hedging algorithm is made up with a top-level algorithm and a set of basic algorithms. To make its decision, an hedging algorithm uses its top-level algorithm to choose a basic algorithm, and the chosen algorithm makes the decision. This paper experimentally shows that well-selected hedging algorithms are better on average than all previous MAL algorithms on the task of playing RMG against various players. S is a very good top-level algorithm, and UCB and M3 are very good basic algorithms. Furthermore, two-level hedging algorithms are more effective than one-level hedging algorithms, and three levels are not better than two levels.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/24/2019

Foolproof Cooperative Learning

This paper extends the notion of equilibrium in game theory to learning ...
research
12/20/2021

Balancing Adaptability and Non-exploitability in Repeated Games

We study the problem of guaranteeing low regret in repeated games agains...
research
09/28/2022

Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning

Equilibrium selection in multi-agent games refers to the problem of sele...
research
10/09/2021

TiKick: Toward Playing Multi-agent Football Full Games from Single-agent Demonstrations

Deep reinforcement learning (DRL) has achieved super-human performance o...
research
02/24/2018

Back to Basics: Benchmarking Canonical Evolution Strategies for Playing Atari

Evolution Strategies (ES) have recently been demonstrated to be a viable...
research
09/03/2020

A Predictive Strategy for the Iterated Prisoner's Dilemma

The iterated prisoner's dilemma is a game that produces many counter-int...
research
12/24/2019

Bidding in Spades

We present a Spades bidding algorithm that is superior to recreational h...

Please sign up or login with your details

Forgot password? Click here to reset