Assessing the Potential of Classical Q-learning in General Game Playing

10/14/2018
by   Hui Wang, et al.
1

After the recent groundbreaking results of AlphaGo and AlphaZero, we have seen strong interests in deep reinforcement learning and artificial general intelligence (AGI) in game playing. However, deep learning is resource-intensive and the theory is not yet well developed. For small games, simple classical table-based Q-learning might still be the algorithm of choice. General Game Playing (GGP) provides a good testbed for reinforcement learning to research AGI. Q-learning is one of the canonical reinforcement learning methods, and has been used by (Banerjee & Stone, IJCAI 2007) in GGP. In this paper we implement Q-learning in GGP for three small-board games (Tic-Tac-Toe, Connect Four, Hex)[source code: https://github.com/wh1992v/ggp-rl], to allow comparison to Banerjee et al.. We find that Q-learning converges to a high win rate in GGP. For the ϵ-greedy strategy, we propose a first enhancement, the dynamic ϵ algorithm. In addition, inspired by (Gelly & Silver, ICML 2007) we combine online search (Monte Carlo Search) to enhance offline learning, and propose QM-learning for GGP. Both enhancements improve the performance of classical Q-learning. In this work, GGP allows us to show, if augmented by appropriate enhancements, that classical table-based Q-learning can perform well in small games.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/16/2018

Monte Carlo Q-learning for General Game Playing

Recently, the interest in reinforcement learning in game playing has bee...
research
04/28/2022

AlphaZero-Inspired General Board Game Learning and Playing

Recently, the seminal algorithms AlphaGo and AlphaZero have started a ne...
research
10/16/2018

At Human Speed: Deep Reinforcement Learning with Action Delay

There has been a recent explosion in the capabilities of game-playing ar...
research
05/13/2021

Adaptive Warm-Start MCTS in AlphaZero-like Deep Reinforcement Learning

AlphaZero has achieved impressive performance in deep reinforcement lear...
research
03/29/2023

Pgx: Hardware-accelerated parallel game simulation for reinforcement learning

We propose Pgx, a collection of board game simulators written in JAX. Th...
research
12/21/2022

On Reinforcement Learning for the Game of 2048

2048 is a single-player stochastic puzzle game. This intriguing and addi...
research
09/11/2018

SAI, a Sensible Artificial Intelligence that plays Go

We propose a multiple-komi modification of the AlphaGo Zero/Leela Zero p...

Please sign up or login with your details

Forgot password? Click here to reset