Evaluation Function Approximation for Scrabble

01/25/2019
by   Rishabh Agarwal, et al.
0

The current state-of-the-art Scrabble agents are not learning-based but depend on truncated Monte Carlo simulations and the quality of such agents is contingent upon the time available for running the simulations. This thesis takes steps towards building a learning-based Scrabble agent using self-play. Specifically, we try to find a better function approximation for the static evaluation function used in Scrabble which determines the move goodness at a given board configuration. In this work, we experimented with evolutionary algorithms and Bayesian Optimization to learn the weights for an approximate feature-based evaluation function. However, these optimization methods were not quite effective, which lead us to explore the given problem from an Imitation Learning point of view. We also tried to imitate the ranking of moves produced by the Quackle simulation agent using supervised learning with a neural network function approximator which takes the raw representation of the Scrabble board as the input instead of using only a fixed number of handcrafted features.

READ FULL TEXT
research
08/23/2022

Solving Royal Game of Ur Using Reinforcement Learning

Reinforcement Learning has recently surfaced as a very powerful tool to ...
research
08/22/2018

Approximating Poker Probabilities with Deep Learning

Many poker systems, whether created with heuristics or machine learning,...
research
09/17/2020

Evolutionary Selective Imitation: Interpretable Agents by Imitation Learning Without a Demonstrator

We propose a new method for training an agent via an evolutionary strate...
research
02/07/2022

A Ranking Game for Imitation Learning

We propose a new framework for imitation learning - treating imitation a...
research
07/18/2019

SentiMATE: Learning to play Chess through Natural Language Processing

We present SentiMATE, a novel end-to-end Deep Learning model for Chess, ...
research
10/25/2022

Sequential Decision Making on Unmatched Data using Bayesian Kernel Embeddings

The problem of sequentially maximizing the expectation of a function see...
research
03/22/2023

CH-Go: Online Go System Based on Chunk Data Storage

The training and running of an online Go system require the support of e...

Please sign up or login with your details

Forgot password? Click here to reset