Reachable Space Characterization of Markov Decision Processes with Time Variability

05/22/2019
by   Junhong Xu, et al.
0

We propose a solution to a time-varying variant of Markov Decision Processes which can be used to address decision-theoretic planning problems for autonomous systems operating in unstructured outdoor environments. We explore the time variability property of the planning stochasticity and investigate the state reachability, based on which we then develop an efficient iterative method that offers a good trade-off between solution optimality and time complexity. The reachability space is constructed by analyzing the means and variances of states' reaching time in the future. We validate our algorithm through extensive simulations using ocean data, and the results show that our method achieves a great performance in terms of both solution quality and computing time.

READ FULL TEXT

page 1

page 7

page 8

research
01/04/2019

Solving Markov Decision Processes with Reachability Characterization from Mean First Passage Times

A new mechanism for efficiently solving the Markov decision processes (M...
research
11/29/2019

Learning and Planning for Time-Varying MDPs Using Maximum Likelihood Estimation

This paper proposes a formal approach to learning and planning for agent...
research
10/22/2017

The Complexity of Graph-Based Reductions for Reachability in Markov Decision Processes

We study the never-worse relation (NWR) for Markov decision processes wi...
research
12/22/2020

Autonomous sPOMDP Environment Modeling With Partial Model Exploitation

A state space representation of an environment is a classic and yet powe...
research
01/03/2019

Reachability and Differential based Heuristics for Solving Markov Decision Processes

The solution convergence of Markov Decision Processes (MDPs) can be acce...
research
07/06/2023

Exploiting Adjoints in Property Directed Reachability Analysis

We formulate, in lattice-theoretic terms, two novel algorithms inspired ...
research
08/05/2022

Planning under periodic observations: bounds and bounding-based solutions

We study planning problems faced by robots operating in uncertain enviro...

Please sign up or login with your details

Forgot password? Click here to reset