Opportunistic Qualitative Planning in Stochastic Systems with Incomplete Preferences over Reachability Objectives

10/04/2022
by   Abhishek N. Kulkarni, et al.
0

Preferences play a key role in determining what goals/constraints to satisfy when not all constraints can be satisfied simultaneously. In this paper, we study how to synthesize preference satisfying plans in stochastic systems, modeled as an MDP, given a (possibly incomplete) combinative preference model over temporally extended goals. We start by introducing new semantics to interpret preferences over infinite plays of the stochastic system. Then, we introduce a new notion of improvement to enable comparison between two prefixes of an infinite play. Based on this, we define two solution concepts called safe and positively improving (SPI) and safe and almost-surely improving (SASI) that enforce improvements with a positive probability and with probability one, respectively. We construct a model called an improvement MDP, in which the synthesis of SPI and SASI strategies that guarantee at least one improvement reduces to computing positive and almost-sure winning strategies in an MDP. We present an algorithm to synthesize the SPI and SASI strategies that induce multiple sequential improvements. We demonstrate the proposed approach using a robot motion planning problem.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/25/2022

Opportunistic Qualitative Planning in Stochastic Systems with Preferences over Temporal Logic Objectives

Preferences play a key role in determining what goals/constraints to sat...
research
09/25/2022

Probabilistic Planning with Partially Ordered Preferences over Temporal Goals

In this paper, we study planning in stochastic systems, modeled as Marko...
research
03/26/2021

Probabilistic Planning with Preferences over Temporal Goals

We present a formal language for specifying qualitative preferences over...
research
05/10/2021

Multi-Objective Controller Synthesis with Uncertain Human Preferences

Multi-objective controller synthesis concerns the problem of computing a...
research
11/29/2019

Refining HTN Methods via Task Insertion with Preferences

Hierarchical Task Network (HTN) planning is showing its power in real-wo...
research
05/26/2023

MDPs as Distribution Transformers: Affine Invariant Synthesis for Safety Objectives

Markov decision processes can be viewed as transformers of probability d...

Please sign up or login with your details

Forgot password? Click here to reset