research
∙
06/03/2011
Experiments with Infinite-Horizon, Policy-Gradient Estimation
In this paper, we present algorithms that perform gradient ascent of the...
research
∙
06/03/2011
Infinite-Horizon Policy-Gradient Estimation
Gradient-based approaches to direct policy search in reinforcement learn...
research
∙
06/01/2011