Solving Factored MDPs with Hybrid State and Action Variables

09/30/2011
by   C. Guestrin, et al.
0

Efficient representations and solutions for large decision problems with continuous and discrete variables are among the most important challenges faced by the designers of automated decision support systems. In this paper, we describe a novel hybrid factored Markov decision process (MDP) model that allows for a compact representation of these problems, and a new hybrid approximate linear programming (HALP) framework that permits their efficient solutions. The central idea of HALP is to approximate the optimal value function by a linear combination of basis functions and optimize its weights by linear programming. We analyze both theoretical and computational aspects of this approach, and demonstrate its scale-up potential on several hybrid optimization problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/13/2012

Partitioned Linear Programming Approximations for MDPs

Approximate linear programming (ALP) is an efficient approach to solving...
research
07/11/2012

Solving Factored MDPs with Continuous and Discrete Variables

Although many real-world stochastic planning problems are more naturally...
research
02/26/2019

Approximate Dynamic Programming with Neural Networks in Linear Discrete Action Spaces

Real-world problems of operations research are typically high-dimensiona...
research
03/05/2016

UTA-poly and UTA-splines: additive value functions with polynomial marginals

Additive utility function models are widely used in multiple criteria de...
research
11/04/2022

Tutorial and Practice in Linear Programming: Optimization Problems in Supply Chain and Transport Logistics

This tutorial is an andragogical guide for students and practitioners se...
research
07/05/2022

Learning to Accelerate Approximate Methods for Solving Integer Programming via Early Fixing

Integer programming (IP) is an important and challenging problem. Approx...
research
12/02/2020

Computing bounds for imprecise continuous-time Markov chains using normal cones

The theory of imprecise Markov chains has achieved significant progress ...

Please sign up or login with your details

Forgot password? Click here to reset