Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End Policy and Optimistic Smooth Fictitious Play

03/07/2023
by   Wei Xi, et al.
0

Deep Reinforcement Learning combined with Fictitious Play shows impressive results on many benchmark games, most of which are, however, single-stage. In contrast, real-world decision making problems may consist of multiple stages, where the observation spaces and the action spaces can be completely different across stages. We study a two-stage strategy card game Legends of Code and Magic and propose an end-to-end policy to address the difficulties that arise in multi-stage game. We also propose an optimistic smooth fictitious play algorithm to find the Nash Equilibrium for the two-player game. Our approach wins double championships of COG2022 competition. Extensive studies verify and show the advancement of our approach.

READ FULL TEXT

page 2

page 6

research
03/09/2023

Mastering Strategy Card Game (Hearthstone) with Improved Techniques

Strategy card game is a well-known genre that is demanding on the intell...
research
03/22/2019

Deep Fictitious Play for Stochastic Differential Games

In this paper, we apply the idea of fictitious play to design deep neura...
research
10/29/2020

Performance Indicators Contributing To Success At The Group And Play-Off Stages Of The 2019 Rugby World Cup

Performance indicators that contributed to success at the group stage an...
research
11/30/2019

Smooth Fictitious Play in N× 2 Potential Games

The paper shows that smooth fictitious play converges to a neighborhood ...
research
10/04/2019

Deep Q-Network for Angry Birds

Angry Birds is a popular video game in which the player is provided with...
research
08/17/2020

Multiagent trajectory models via game theory and implicit layer-based learning

For prediction of interacting agents' trajectories, we propose an end-to...
research
03/09/2021

Learning to Play Soccer From Scratch: Sample-Efficient Emergent Coordination through Curriculum-Learning and Competition

This work proposes a scheme that allows learning complex multi-agent beh...

Please sign up or login with your details

Forgot password? Click here to reset