
Solving CommonPayoff Games with Approximate Policy Iteration
For artificially intelligent learning systems to have widespread applica...
read it

Hindsight and Sequential Rationality of Correlated Play
Driven by recent successes in twoplayer, zerosum game solving and play...
read it

Negotiating Team Formation Using Deep Reinforcement Learning
When autonomous agents interact in the same environment, they must often...
read it

The Advantage RegretMatching ActorCritic
Regret minimization has played a key role in online learning, equilibriu...
read it

Sound Search in Imperfect Information Games
Search has played a fundamental role in computer game research since the...
read it

Learning to Play NoPress Diplomacy with Best Response Policy Iteration
Recent advances in deep reinforcement learning (RL) have led to consider...
read it

Approximate exploitability: Learning a best response in large games
A common metric in games of imperfect information is exploitability, i.e...
read it

From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization
In this paper we investigate the Follow the Regularized Leader dynamics ...
read it

A Generalized Training Approach for Multiagent Learning
This paper investigates a populationbased training regime based on game...
read it

OpenSpiel: A Framework for Reinforcement Learning in Games
OpenSpiel is a collection of environments and algorithms for research in...
read it

Neural Replicator Dynamics
In multiagent learning, agents interact in inherently nonstationary envi...
read it

Computing Approximate Equilibria in Sequential Adversarial Games by Exploitability Descent
In this paper, we present exploitability descent, a new algorithm to com...
read it

αRank: MultiAgent Evaluation by Evolution
We introduce αRank, a principled evolutionary dynamics methodology, for...
read it

Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for MultiAgent Intelligence Research
Evolution has produced a multiscale mosaic of interacting adaptive unit...
read it

The Hanabi Challenge: A New Frontier for AI Research
From the early days of computing, games have been important testbeds for...
read it

ActorCritic Policy Optimization in Partially Observable Multiagent Environments
Optimization of parameterized policies for reinforcement learning (RL) i...
read it

Variance Reduction in Monte Carlo Counterfactual Regret Minimization (VRMCCFR) for Extensive Form Games using Baselines
Learning strategies for imperfect information games from samples of inte...
read it

Emergent Communication through Negotiation
Multiagent reinforcement learning offers a way to study how communicati...
read it

A Generalised Method for Empirical Game Theoretic Analysis
This paper provides theoretical bounds for empirical game theoretical an...
read it

Mastering Chess and Shogi by SelfPlay with a General Reinforcement Learning Algorithm
The game of chess is the most widelystudied domain in the history of ar...
read it

Symmetric Decomposition of Asymmetric Games
We introduce new theoretical insights into twopopulation asymmetric gam...
read it

A Unified GameTheoretic Approach to Multiagent Reinforcement Learning
To achieve general intelligence, agents must learn how to interact with ...
read it

ValueDecomposition Networks For Cooperative MultiAgent Learning
We study the problem of cooperative multiagent reinforcement learning w...
read it

Deep Qlearning from Demonstrations
Deep reinforcement learning (RL) has achieved several high profile succe...
read it

Multiagent Reinforcement Learning in Sequential Social Dilemmas
Matrix games like Prisoner's Dilemma have guided research on social dile...
read it

MemoryEfficient Backpropagation Through Time
We propose a novel approach to reduce memory consumption of the backprop...
read it

Convolution by Evolution: Differentiable Pattern Producing Networks
In this work we introduce a differentiable version of the Compositional ...
read it

Monte Carlo Tree Search with Heuristic Evaluations using Implicit Minimax Backups
Monte Carlo Tree Search (MCTS) has improved the performance of game engi...
read it

NoRegret Learning in ExtensiveForm Games with Imperfect Recall
Counterfactual Regret Minimization (CFR) is an efficient noregret learn...
read it