Chat Image Generator Video Music Voice Chat Photo Editor

On the convergence of optimistic policy iteration for stochastic shortest path problem

08/27/2018

∙

In this paper, we prove some convergence results of a special case of optimistic policy iteration algorithm for stochastic shortest path problem. We consider the Monte Carlo and TD(λ) methods for the policy evaluation step under the condition that the termination state will be reached almost surely.

READ FULL TEXT

Success!

An error occurred

On the convergence of optimistic policy iteration for stochastic shortest path problem

Sign in with Google

Consider DeepAI Pro