In this paper, we present a simple and cheap ordinal bucketing algorithm...
Reinforcement learning usually makes use of numerical rewards, which hav...
In many problem settings, most notably in game playing, an agent receive...
Monte Carlo tree search (MCTS) is a popular choice for solving sequentia...