-
Episodic Reinforcement Learning in Finite MDPs: Minimax Lower Bounds Revisited
In this paper, we propose new problem-independent lower bounds on the sa...
read it
-
Fast active learning for pure exploration in reinforcement learning
Realistic environments often provide agents with very limited feedback. ...
read it
-
A Kernel-Based Approach to Non-Stationary Reinforcement Learning in Metric Spaces
In this work, we propose KeRNS: an algorithm for episodic reinforcement ...
read it
-
Adaptive Reward-Free Exploration
Reward-free exploration is a reinforcement learning setting recently stu...
read it
-
Planning in Markov Decision Processes with Gap-Dependent Sample Complexity
We propose MDP-GapE, a new trajectory-based Monte-Carlo Tree Search algo...
read it
-
Regret Bounds for Kernel-Based Reinforcement Learning
We consider the exploration-exploitation dilemma in finite-horizon reinf...
read it

Omar Darwiche Domingues
is this you? claim profile