research
∙
09/09/2020
Improved Exploration in Factored Average-Reward MDPs
We consider a regret minimization task under the average-reward criterio...
research
∙
04/20/2020
Tightening Exploration in Upper Confidence Reinforcement Learning
The upper confidence reinforcement learning (UCRL2) strategy introduced ...
research
∙
10/09/2019
Model-Based Reinforcement Learning Exploiting State-Action Equivalence
Leveraging an equivalence property in the state-space of a Markov Decisi...
research
∙
03/05/2018