research
∙
11/12/2021
Q-Learning for MDPs with General Spaces: Convergence and Near Optimality via Quantization under Weak Continuity
Reinforcement learning algorithms often require finiteness of state and ...
research
∙
03/22/2021
Convergence of Finite Memory Q-Learning for POMDPs and Near Optimality of Learned Policies under Filter Stability
In this paper, for POMDPs, we provide the convergence of a Q learning al...
research
∙
10/15/2020