The Winning Solution to the IEEE CIG 2017 Game Data Mining Competition

01/16/2019
by   Anna Guitart, et al.
18

Machine learning competitions such as those organized by Kaggle or KDD represent a useful benchmark for data science research. In this work, we present our winning solution to the Game Data Mining competition hosted at the 2017 IEEE Conference on Computational Intelligence and Games (CIG 2017). The contest consisted of two tracks, and participants (more than 250, belonging to both industry and academia) were to predict which players would stop playing the game, as well as their remaining lifetime. The data were provided by a major worldwide video game company, NCSoft, and came from their successful massively multiplayer online game Blade and Soul. Here, we describe the long short-term memory approach and conditional inference survival ensemble model that made us win both tracks of the contest, as well as the validation procedure that we followed in order to prevent overfitting. In particular, choosing a survival method able to deal with censored data was crucial to accurately predict the moment in which each player would leave the game, as censoring is inherent in churn. The selected models proved to be robust against evolving conditions---since there was a change in the business model of the game (from subscription-based to free-to-play) between the two sample datasets provided---and efficient in terms of time cost. Thanks to these features and also to their a ability to scale to large datasets, our models could be readily implemented in real business settings.

READ FULL TEXT

page 2

page 4

page 5

page 6

page 8

page 11

page 12

page 13

research
02/07/2018

Game Data Mining Competition on Churn Prediction and Survival Analysis using Commercial Game Log Data

Usually, game companies avoid sharing their game data with external rese...
research
10/06/2017

Games and Big Data: A Scalable Multi-Dimensional Churn Prediction Model

The emergence of mobile games has caused a paradigm shift in the video-g...
research
09/07/2021

IEEE BigData 2021 Cup: Soft Sensing at Scale

IEEE BigData 2021 Cup: Soft Sensing at Scale is a data mining competitio...
research
06/25/2019

From Non-Paying to Premium: Predicting User Conversion in Video Games with Ensemble Learning

Retaining premium players is key to the success of free-to-play games, b...
research
07/09/2019

Profiling Players with Engagement Predictions

The possibility of using player engagement predictions to profile high s...
research
05/15/2019

Survival of the Fittest in PlayerUnknown BattleGround

The goal of this paper was to predict the placement in the multiplayer g...
research
04/24/2021

Highly Efficient Memory Failure Prediction using Mcelog-based Data Mining and Machine Learning

In the data center, unexpected downtime caused by memory failures can le...

Please sign up or login with your details

Forgot password? Click here to reset