In 2019, Rhoban Football Club reached the first place of the KidSize soc...
Online Reinforcement Learning for Real-Time Exploration in Continuous State and Action Markov Decision Processes
This paper presents a new method to learn online policies in continuous ...
Ludovic Hoferis this you? claim profile