On Reinforcement Learning for the Game of 2048

12/21/2022
by   Hung Guei, et al.
0

2048 is a single-player stochastic puzzle game. This intriguing and addictive game has been popular worldwide and has attracted researchers to develop game-playing programs. Due to its simplicity and complexity, 2048 has become an interesting and challenging platform for evaluating the effectiveness of machine learning methods. This dissertation conducts comprehensive research on reinforcement learning and computer game algorithms for 2048. First, this dissertation proposes optimistic temporal difference learning, which significantly improves the quality of learning by employing optimistic initialization to encourage exploration for 2048. Furthermore, based on this approach, a state-of-the-art program for 2048 is developed, which achieves the highest performance among all learning-based programs, namely an average score of 625377 points and a rate of 72 dissertation investigates several techniques related to 2048, including the n-tuple network ensemble learning, Monte Carlo tree search, and deep reinforcement learning. These techniques are promising for further improving the performance of the current state-of-the-art program. Finally, this dissertation discusses pedagogical applications related to 2048 by proposing course designs and summarizing the teaching experience. The proposed course designs use 2048-like games as materials for beginners to learn reinforcement learning and computer game algorithms. The courses have been successfully applied to graduate-level students and received well by student feedback.

READ FULL TEXT

page 11

page 21

page 32

page 39

research
12/17/2017

Towards a Deep Reinforcement Learning Approach for Tower Line Wars

There have been numerous breakthroughs with reinforcement learning in th...
research
11/30/2020

Applied Machine Learning for Games: A Graduate School Course

The game industry is moving into an era where old-style game engines are...
research
11/22/2021

Optimistic Temporal Difference Learning for 2048

Temporal difference (TD) learning and its variants, such as multistage T...
research
10/20/2021

Playing 2048 With Reinforcement Learning

The game of 2048 is a highly addictive game. It is easy to learn the gam...
research
10/14/2018

Assessing the Potential of Classical Q-learning in General Game Playing

After the recent groundbreaking results of AlphaGo and AlphaZero, we hav...
research
09/10/2018

ViZDoom Competitions: Playing Doom from Pixels

This paper presents the first two editions of Visual Doom AI Competition...
research
11/26/2022

Evaluation Beyond Task Performance: Analyzing Concepts in AlphaZero in Hex

AlphaZero, an approach to reinforcement learning that couples neural net...

Please sign up or login with your details

Forgot password? Click here to reset