Monte Carlo Q-learning for General Game Playing

02/16/2018
by   Hui Wang, et al.
0

Recently, the interest in reinforcement learning in game playing has been renewed. This is evidenced by the groundbreaking results achieved by AlphaGo. General Game Playing (GGP) provides a good testbed for reinforcement learning, currently one of the hottest fields of AI. In GGP, a specification of games rules is given. The description specifies a reinforcement learning problem, leaving programs to find strategies for playing well. Q-learning is one of the canonical reinforcement learning methods, which is used as baseline on some previous work (Banerjee & Stone, IJCAI 2007). We implement Q-learning in GGP for three small board games (Tic-Tac-Toe, Connect-Four, Hex). We find that Q-learning converges, and thus that this general reinforcement learning method is indeed applicable to General Game Playing. However, convergence is slow, in comparison to MCTS (a reinforcement learning method reported to achieve good results). We enhance Q-learning with Monte Carlo Search. This enhancement improves performance of pure Q-learning, although it does not yet out-perform MCTS. Future work is needed into the relation between MCTS and Q-learning, and on larger problem instances.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/14/2018

Assessing the Potential of Classical Q-learning in General Game Playing

After the recent groundbreaking results of AlphaGo and AlphaZero, we hav...
research
08/23/2022

Solving Royal Game of Ur Using Reinforcement Learning

Reinforcement Learning has recently surfaced as a very powerful tool to ...
research
10/20/2021

Playing 2048 With Reinforcement Learning

The game of 2048 is a highly addictive game. It is easy to learn the gam...
research
04/01/2020

A New Challenge: Approaching Tetris Link with AI

Decades of research have been invested in making computer programs for p...
research
04/07/2021

On the Power of Refined Skat Selection

Skat is a fascinating combinatorial card game, show-casing many of the i...
research
12/10/2015

Convolutional Monte Carlo Rollouts in Go

In this work, we present a MCTS-based Go-playing program which uses conv...
research
03/23/2020

Neural Game Engine: Accurate learning ofgeneralizable forward models from pixels

Access to a fast and easily copied forward model of a game is essential ...

Please sign up or login with your details

Forgot password? Click here to reset