Planning in POMDPs Using Multiplicity Automata

07/04/2012
by   Eyal Even-Dar, et al.
0

Planning and learning in Partially Observable MDPs (POMDPs) are among the most challenging tasks in both the AI and Operation Research communities. Although solutions to these problems are intractable in general, there might be special cases, such as structured POMDPs, which can be solved efficiently. A natural and possibly efficient way to represent a POMDP is through the predictive state representation (PSR) - a representation which recently has been receiving increasing attention. In this work, we relate POMDPs to multiplicity automata- showing that POMDPs can be represented by multiplicity automata with no increase in the representation size. Furthermore, we show that the size of the multiplicity automaton is equal to the rank of the predictive state representation. Therefore, we relate both the predictive state representation and POMDPs to the well-founded multiplicity automata literature. Based on the multiplicity automata representation, we provide a planning algorithm which is exponential only in the multiplicity automata rank rather than the number of states of the POMDP. As a result, whenever the predictive state representation is logarithmic in the standard POMDP representation, our planning algorithm is efficient.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/20/2020

Quantum Tensor Networks, Stochastic Processes, and Weighted Automata

Modeling joint probability distributions over sequences has been studied...
research
01/02/2020

Representing Unordered Data Using Multiset Automata and Complex Numbers

Unordered, variable-sized inputs arise in many settings across multiple ...
research
07/05/2019

Nonuniform Families of Polynomial-Size Quantum Finite Automata and Quantum Logarithmic-Space Computation with Polynomial-Size Advice

The state complexity of a finite(-state) automaton intuitively measures ...
research
10/19/2021

Sky Is Not the Limit: Tighter Rank Bounds for Elevator Automata in Büchi Automata Complementation (Technical Report)

We propose several heuristics for mitigating one of the main causes of c...
research
10/15/2020

Reducing (to) the Ranks: Efficient Rank-based Büchi Automata Complementation (Technical Report)

This paper provides several optimizations of the rank-based approach for...
research
05/06/2022

Alternating Good-for-MDP Automata

When omega-regular objectives were first proposed in model-free reinforc...
research
05/17/2019

Simulations in Rank-Based Büchi Automata Complementation

The long search for an optimal complementation construction for Büchi au...

Please sign up or login with your details

Forgot password? Click here to reset