
Tackling Morpion Solitaire with AlphaZerolikeRanked Reward Reinforcement Learning
Morpion Solitaire is a popular single player game, performed with paper ...
The Second Type of Uncertainty in Monte Carlo Tree Search
Monte Carlo Tree Search (MCTS) efficiently balances exploration and expl...
WarmStart AlphaZero SelfPlay Search Enhancements
Recently, AlphaZero has achieved landmark results in deep reinforcement ...
A New Challenge: Approaching Tetris Link with AI
Decades of research have been invested in making computer programs for p...
Analysis of HyperParameters for Small Games: Iterations or Epochs in SelfPlay?
The landmark achievements of AlphaGo Zero have created great research in...
FaultTolerant Nanosatellite Computing on a Budget
Micro and nanosatellites have become popular platforms for a variety of...
HyperParameter Sweep on AlphaZero General
Since AlphaGo and AlphaGo Zero have achieved breakground successes in th...
Dynamic Fault Tolerance Through Resource Pooling
Miniaturized satellites are currently not considered suitable for critic...
Assessing the Potential of Classical Qlearning in General Game Playing
After the recent groundbreaking results of AlphaGo and AlphaZero, we hav...
A0C: Alpha Zero in Continuous Action Space
A core novelty of Alpha Zero is the interleaving of tree search and deep...
Monte Carlo Tree Search for Asymmetric Trees
We present an extension of Monte Carlo Tree Search (MCTS) that strongly ...
Monte Carlo Qlearning for General Game Playing
Recently, the interest in reinforcement learning in game playing has bee...
Bringing FaultTolerant GigaHertzComputing to Space: A MultiStage SoftwareSide FaultTolerance Approach for Miniaturized Spacecraft
Modern embedded technology is a driving factor in satellite miniaturizat...
Structured Parallel Programming for Monte Carlo Tree Search
In this paper, we present a new algorithm for parallel Monte Carlo tree ...
A Minimax Algorithm Better Than Alphabeta?: No and Yes
This paper has three main contributions to our understanding of fixedde...
Ensemble UCT Needs High Exploitation
Recent results have shown that the MCTS algorithm (a new, adaptive, rand...
BestFirst and DepthFirst Minimax Search in Practice
Most practitioners use a variant of the AlphaBeta algorithm, a simple d...
Data Science and Ebola
Data ScienceToday, everybody and everything produces data. People pro...
Why Local Search Excels in Expression Simplification
Simplifying expressions is important to make numerical integration of la...
HEPGAME and the Simplification of Expressions
Advances in high energy physics have created the need to increase comput...
Nearly Optimal Minimax Tree Search?
Knuth and Moore presented a theoretical lower bound on the number of lea...
SSS* = AlphaBeta + TT
In 1979 Stockman introduced the SSS* minimax search algorithm that domi...
A New Paradigm for Minimax Search
This paper introduces a new paradigm for minimax gametree search algo ...
MTD(f), A Minimax Algorithm Faster Than NegaScout
MTD(f) is a new minimax search algorithm, simpler and more efficient tha...
Combining Simulated Annealing and Monte Carlo Tree Search for Expression Simplification
In many applications of computer algebra large expressions must be simpl...
Aske Plaat
Professor & Scientific Director of Computer Science at Leiden University