Macro action selection with deep reinforcement learning in StarCraft

12/02/2018
by   Sijia Xu, et al.
0

StarCraft (SC) is one of the most popular and successful Real Time Strategy (RTS) games. In recent years, SC is also considered as a testbed for AI research, due to its enormous state space, hidden information, multi-agent collaboration and so on. Thanks to the annual AIIDE and CIG competitions, a growing number of bots are proposed and being continuously improved. However, a big gap still remains between the top bot and the professional human players. One vital reason is that current bots mainly rely on predefined rules to perform macro actions. These rules are not scalable and efficient enough to cope with the large but partially observed macro state space in SC. In this paper, we propose a DRL based framework to do macro action selection. Our framework combines the reinforcement learning approach Ape-X DQN with Long-Short-Term-Memory (LSTM) to improve the macro action selection in bot. We evaluate our bot, named as LastOrder, on the AIIDE 2017 StarCraft AI competition bots set. Our bot achieves overall 83 bots in total 28 entrants.

READ FULL TEXT
research
04/18/2020

Macro-Action-Based Deep Multi-Agent Reinforcement Learning

In real-world multi-robot systems, performing high-quality, collaborativ...
research
01/23/2019

Hierarchical Reinforcement Learning for Multi-agent MOBA Game

Although deep reinforcement learning has achieved great success recently...
research
08/05/2019

Construction of Macro Actions for Deep Reinforcement Learning

Conventional deep reinforcement learning typically determines an appropr...
research
05/29/2023

Action valuation of on- and off-ball soccer players based on multi-agent deep reinforcement learning

Analysis of invasive sports such as soccer is challenging because the ga...
research
12/19/2018

Hierarchical Macro Strategy Model for MOBA Game AI

The next challenge of game AI lies in Real Time Strategy (RTS) games. RT...
research
04/20/2020

The new methods for equity fund selection and optimal portfolio construction

We relook at the classic equity fund selection and portfolio constructio...
research
01/14/2021

Ensemble of LSTMs and feature selection for human action prediction

As robots are becoming more and more ubiquitous in human environments, i...

Please sign up or login with your details

Forgot password? Click here to reset