Point-Based POMDP Algorithms: Improved Analysis and Implementation

07/04/2012
by   Trey Smith, et al.
0

Existing complexity bounds for point-based POMDP value iteration algorithms focus either on the curse of dimensionality or the curse of history. We derive a new bound that relies on both and uses the concept of discounted reachability; our conclusions may help guide future algorithm design. We also discuss recent improvements to our (point-based) heuristic search value iteration algorithm. Our new implementation calculates tighter initial bounds, avoids solving linear programs, and makes more effective use of sparsity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/18/2019

Stopping Criteria for Value and Strategy Iteration on Concurrent Stochastic Reachability Games

We consider concurrent stochastic games played on graphs with reachabili...
research
07/11/2012

Heuristic Search Value Iteration for POMDPs

We present a novel POMDP planning algorithm called heuristic search valu...
research
09/23/2020

Comparison of Algorithms for Simple Stochastic Games

Simple stochastic games are turn-based 2.5-player zero-sum graph games w...
research
08/21/2020

Comparison of Algorithms for Simple Stochastic Games (Full Version)

Simple stochastic games are turn-based 2.5-player zero-sum graph games w...
research
10/02/2019

Optimistic Value Iteration

Markov decision processes are widely used for planning and verification ...
research
01/30/2013

Solving POMDPs by Searching in Policy Space

Most algorithms for solving POMDPs iteratively improve a value function ...
research
04/30/2012

OCT Segmentation Survey and Summary Reviews and a Novel 3D Segmentation Algorithm and a Proof of Concept Implementation

We overview the existing OCT work, especially the practical aspects of i...

Please sign up or login with your details

Forgot password? Click here to reset