Learning to Reason in Round-based Games: Multi-task Sequence Generation for Purchasing Decision Making in First-person Shooters

08/12/2020
by   Yilei Zeng, et al.
15

Sequential reasoning is a complex human ability, with extensive previous research focusing on gaming AI in a single continuous game, round-based decision makings extending to a sequence of games remain less explored. Counter-Strike: Global Offensive (CS:GO), as a round-based game with abundant expert demonstrations, provides an excellent environment for multi-player round-based sequential reasoning. In this work, we propose a Sequence Reasoner with Round Attribute Encoder and Multi-Task Decoder to interpret the strategies behind the round-based purchasing decisions. We adopt few-shot learning to sample multiple rounds in a match, and modified model agnostic meta-learning algorithm Reptile for the meta-learning loop. We formulate each round as a multi-task sequence generation problem. Our state representations combine action encoder, team encoder, player features, round attribute encoder, and economy encoders to help our agent learn to reason under this specific multi-player round-based scenario. A complete ablation study and comparison with the greedy approach certify the effectiveness of our model. Our research will open doors for interpretable AI for understanding episodic and long-term purchasing strategies beyond the gaming community.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/04/2022

Meta-learning from Learning Curves Challenge: Lessons learned from the First Round and Design of the Second Round

Meta-learning from learning curves is an important yet often neglected r...
research
08/13/2019

Meta Reasoning over Knowledge Graphs

The ability to reason over learned knowledge is an innate ability for hu...
research
09/20/2021

Optimal Team Economic Decisions in Counter-Strike

The outputs of win probability models are often used to evaluate player ...
research
12/18/2020

Which Heroes to Pick? Learning to Draft in MOBA Games with Neural Networks and Tree Search

Hero drafting is essential in MOBA game playing as it builds the team of...
research
05/22/2018

Multi-task Maximum Entropy Inverse Reinforcement Learning

Multi-task Inverse Reinforcement Learning (IRL) is the problem of inferr...
research
12/01/2021

Meta Arcade: A Configurable Environment Suite for Meta-Learning

Most approaches to deep reinforcement learning (DRL) attempt to solve a ...
research
08/17/2017

General AI Challenge - Round One: Gradual Learning

The General AI Challenge is an initiative to encourage the wider artific...

Please sign up or login with your details

Forgot password? Click here to reset