Multiple Plans are Better than One: Diverse Stochastic Planning

12/31/2020
by   Mahsa Ghasemi, et al.
23

In planning problems, it is often challenging to fully model the desired specifications. In particular, in human-robot interaction, such difficulty may arise due to human's preferences that are either private or complex to model. Consequently, the resulting objective function can only partially capture the specifications and optimizing that may lead to poor performance with respect to the true specifications. Motivated by this challenge, we formulate a problem, called diverse stochastic planning, that aims to generate a set of representative – small and diverse – behaviors that are near-optimal with respect to the known objective. In particular, the problem aims to compute a set of diverse and near-optimal policies for systems modeled by a Markov decision process. We cast the problem as a constrained nonlinear optimization for which we propose a solution relying on the Frank-Wolfe method. We then prove that the proposed solution converges to a stationary point and demonstrate its efficacy in several planning problems.

READ FULL TEXT

page 7

page 8

research
01/24/2023

NeSIG: A Neuro-Symbolic Method for Learning to Generate Planning Problems

In the field of Automated Planning there is often the need for a set of ...
research
02/27/2023

Robust Robot Planning for Human-Robot Collaboration

In human-robot collaboration, the objectives of the human are often unkn...
research
06/06/2022

Randomized Synthesis for Diversity and Cost Constraints with Control Improvisation

In many synthesis problems, it can be essential to generate implementati...
research
06/13/2012

Speeding Up Planning in Markov Decision Processes via Automatically Constructed Abstractions

In this paper, we consider planning in stochastic shortest path (SSP) pr...
research
03/12/2018

Solving Markov decision processes for network-level post-hazard recovery via simulation optimization and rollout

Computation of optimal recovery decisions for community resilience assur...
research
07/09/2022

Optimal policies for Bayesian olfactory search in turbulent flows

In many practical scenarios, a flying insect must search for the source ...
research
10/21/2022

Efficient Global Planning in Large MDPs via Stochastic Primal-Dual Optimization

We propose a new stochastic primal-dual optimization algorithm for plann...

Please sign up or login with your details

Forgot password? Click here to reset