Dual Sequential Monte Carlo: Tunneling Filtering and Planning in Continuous POMDPs

09/28/2019
by   Yunbo Wang, et al.
15

We present the DualSMC network that solves continuous POMDPs by learning belief representations and then leveraging them for planning. It is based on the fact that filtering, i.e. state estimation, and planning can be viewed as two related sequential Monte Carlo processes, with one in the belief space and the other in the future planning trajectory space. In particular, we first introduce a novel particle filter network that makes better use of the adversarial relationship between the proposer model and the observation model. We then introduce a new planning algorithm over the belief representations, which learns uncertainty-dependent policies. We allow these two parts to be trained jointly with each other. We testify the effectiveness of our approach on three continuous control and planning tasks: the floor positioning, the 3D light-dark navigation, and a modified Reacher task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/05/2021

Deep Synoptic Monte Carlo Planning in Reconnaissance Blind Chess

This paper introduces deep synoptic Monte Carlo planning (DSMCP) for lar...
research
11/14/2022

Monte Carlo Planning in Hybrid Belief POMDPs

Real-world problems often require reasoning about hybrid beliefs, over b...
research
11/04/2020

An On-Line POMDP Solver for Continuous Observation Spaces

Planning under partial obervability is essential for autonomous robots. ...
research
01/23/2013

Approximate Planning for Factored POMDPs using Belief State Simplification

We are interested in the problem of planning for factored POMDPs. Buildi...
research
02/26/2020

SACBP: Belief Space Planning for Continuous-Time Dynamical Systems via Stochastic Sequential Action Control

We propose a novel belief space planning technique for continuous dynami...
research
04/20/2023

Filter-Aware Model-Predictive Control

Partially-observable problems pose a trade-off between reducing costs an...
research
09/23/2022

involve-MI: Informative Planning with High-Dimensional Non-Parametric Beliefs

One of the most complex tasks of decision making and planning is to gath...

Please sign up or login with your details

Forgot password? Click here to reset