research
∙
07/02/2022
q-Learning in Continuous Time
We study the continuous-time counterpart of Q-learning for reinforcement...
research
∙
11/22/2021
Policy Gradient and Actor-Critic Learning in Continuous Time and Space: Theory and Algorithms
We study policy gradient (PG) for reinforcement learning in continuous t...
research
∙
08/15/2021