MSC: A Dataset for Macro-Management in StarCraft II

10/09/2017
by   Huikai Wu, et al.
0

Macro-management is an important problem in StarCraft, which has been studied for a long time. Various datasets together with assorted methods have been proposed in the last few years. But these datasets have some defects for boosting the academic and industrial research: 1) There're neither standard preprocessing, parsing and feature extraction procedures nor predefined training, validation and test set in some datasets. 2) Some datasets are only specified for certain tasks in macro-management. 3) Some datasets are either too small or don't have enough labeled data for modern machine learning algorithms such as deep neural networks. So most previous methods are trained with various features, evaluated on different test sets from the same or different datasets, making it difficult to be compared directly. To boost the research of macro-management in StarCraft, we release a new dataset MSC based on the platform SC2LE. MSC consists of well-designed feature vectors, pre-defined high-level actions and final result of each match. We also split MSC into training, validation and test set for the convenience of evaluation and comparison. Besides the dataset, we propose a baseline model and present initial baseline results for global state evaluation and build order prediction, which are two of the key tasks in macro-management. Various downstream tasks and analyses of the dataset are also described for the sake of research on macro-management in StarCraft II. Homepage: https://github.com/wuhuikai/MSC.

READ FULL TEXT

page 2

page 6

research
04/12/2021

SuperSim: a test set for word similarity and relatedness in Swedish

Language models are notoriously difficult to evaluate. We release SuperS...
research
09/02/2020

A Practical Chinese Dependency Parser Based on A Large-scale Dataset

Dependency parsing is a longstanding natural language processing task, w...
research
07/12/2023

No Train No Gain: Revisiting Efficient Training Algorithms For Transformer-based Language Models

The computation necessary for training Transformer-based language models...
research
03/15/2023

Dataset Management Platform for Machine Learning

The quality of the data in a dataset can have a substantial impact on th...
research
08/20/2018

Reproducible evaluation of classification methods in Alzheimer's disease: framework and application to MRI and PET data

A large number of papers have introduced novel machine learning and feat...
research
05/20/2021

Manual Evaluation Matters: Reviewing Test Protocols of Distantly Supervised Relation Extraction

Distantly supervised (DS) relation extraction (RE) has attracted much at...
research
07/22/2018

Macro-Micro Adversarial Network for Human Parsing

In human parsing, the pixel-wise classification loss has drawbacks in it...

Please sign up or login with your details

Forgot password? Click here to reset