Towards Modern Card Games with Large-Scale Action Spaces Through Action Representation

06/25/2022
by   Zhiyuan Yao, et al.
0

Axie infinity is a complicated card game with a huge-scale action space. This makes it difficult to solve this challenge using generic Reinforcement Learning (RL) algorithms. We propose a hybrid RL framework to learn action representations and game strategies. To avoid evaluating every action in the large feasible action set, our method evaluates actions in a fixed-size set which is determined using action representations. We compare the performance of our method with the other two baseline methods in terms of their sample efficiency and the winning rates of the trained models. We empirically show that our method achieves an overall best winning rate and the best sample efficiency among the three methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/02/2020

Action Space Shaping in Deep Reinforcement Learning

Reinforcement learning (RL) has been successful in training agents in va...
research
04/15/2021

Generalising Discrete Action Spaces with Conditional Action Trees

There are relatively few conventions followed in reinforcement learning ...
research
10/28/2020

Learning to Represent Action Values as a Hypergraph on the Action Vertices

Action-value estimation is a critical component of many reinforcement le...
research
11/20/2019

Solving Online Threat Screening Games using Constrained Action Space Reinforcement Learning

Large-scale screening for potential threats with limited resources and c...
research
10/07/2019

Reinforcement Learning with Structured Hierarchical Grammar Representations of Actions

From a young age humans learn to use grammatical principles to hierarchi...
research
11/22/2017

Asymmetric Action Abstractions for Multi-Unit Control in Adversarial Real-Time Games

Action abstractions restrict the number of legal actions available durin...
research
10/26/2019

Comparing Observation and Action Representations for Deep Reinforcement Learning in MicroRTS

This paper presents a preliminary study comparing different observation ...

Please sign up or login with your details

Forgot password? Click here to reset