We present an approach for the quantification of the usefulness of trans...
We consider a setting in which the objective is to learn to navigate in ...
We consider undiscounted reinforcement learning in Markov decision proce...
We give a simple optimistic algorithm for which it is easy to derive reg...
We consider reinforcement learning in changing Markov Decision Processes...
We introduce SCAL, an algorithm designed to perform efficient
exploratio...
We consider the restless Markov bandit problem, in which the state of ea...