Approximate Linear Programming for First-order MDPs

07/04/2012
by   Scott Sanner, et al.
0

We introduce a new approximate solution technique for first-order Markov decision processes (FOMDPs). Representing the value function linearly w.r.t. a set of first-order basis functions, we compute suitable weights by casting the corresponding optimization as a first-order linear program and show how off-the-shelf theorem prover and LP software can be effectively used. This technique allows one to solve FOMDPs independent of a specific domain instantiation; furthermore, it allows one to determine bounds on approximation error that apply equally to all domain instantiations. We apply this solution technique to the task of elevator scheduling with a rich feature space and multi-criteria additive reward, and demonstrate that it outperforms a number of intuitive, heuristicallyguided policies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/13/2012

Partitioned Linear Programming Approximations for MDPs

Approximate linear programming (ALP) is an efficient approach to solving...
research
06/09/2011

Efficient Solution Algorithms for Factored MDPs

This paper addresses the problem of planning under uncertainty in large ...
research
11/29/2015

Exploiting Anonymity in Approximate Linear Programming: Scaling to Large Multiagent MDPs (Extended Version)

Many exact and approximate solution methods for Markov Decision Processe...
research
10/12/2014

Relational Linear Programs

We propose relational linear programming, a simple framework for combing...
research
06/28/2022

Linear programming-based solution methods for constrained POMDPs

Constrained partially observable Markov decision processes (CPOMDPs) hav...
research
05/11/2010

Feature Selection Using Regularization in Approximate Linear Programs for Markov Decision Processes

Approximate dynamic programming has been used successfully in a large va...
research
03/05/2016

UTA-poly and UTA-splines: additive value functions with polynomial marginals

Additive utility function models are widely used in multiple criteria de...

Please sign up or login with your details

Forgot password? Click here to reset