Monte Carlo Planning in Hybrid Belief POMDPs

11/14/2022
by   Moran Barenboim, et al.
0

Real-world problems often require reasoning about hybrid beliefs, over both discrete and continuous random variables. Yet, such a setting has hardly been investigated in the context of planning. Moreover, existing online Partially Observable Markov Decision Processes (POMDPs) solvers do not support hybrid beliefs directly. In particular, these solvers do not address the added computational burden due to an increasing number of hypotheses with the planning horizon, which can grow exponentially. As part of this work, we present a novel algorithm, Hybrid Belief Monte Carlo Planning (HB-MCP) that utilizes the Monte Carlo Tree Search (MCTS) algorithm to solve a POMDP while maintaining a hybrid belief. We illustrate how the upper confidence bound (UCB) exploration bonus can be leveraged to guide the growth of hypotheses trees alongside the belief trees. We then evaluate our approach in highly aliased simulated environments where unresolved data association leads to multi-modal belief hypotheses.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/31/2023

BetaZero: Belief-State Planning for Long-Horizon POMDPs using Learned Approximations

Real-world planning problemsx2014including autonomous driving and sustai...
research
06/09/2023

Combining a Meta-Policy and Monte-Carlo Planning for Scalable Type-Based Reasoning in Partially Observable Environments

The design of autonomous agents that can interact effectively with other...
research
03/21/2021

Monte Carlo Information-Oriented Planning

In this article, we discuss how to solve information-gathering problems ...
research
03/03/2023

Data Association Aware POMDP Planning with Hypothesis Pruning Performance Guarantees

Autonomous agents that operate in the real world must often deal with pa...
research
03/20/2013

A Monte-Carlo Algorithm for Dempster-Shafer Belief

A very computationally-efficient Monte-Carlo algorithm for the calculati...
research
09/28/2019

Dual Sequential Monte Carlo: Tunneling Filtering and Planning in Continuous POMDPs

We present the DualSMC network that solves continuous POMDPs by learning...
research
02/10/2022

D2A-BSP: Distilled Data Association Belief Space Planning with Performance Guarantees Under Budget Constraints

Unresolved data association in ambiguous and perceptually aliased enviro...

Please sign up or login with your details

Forgot password? Click here to reset