Combining Offline Models and Online Monte-Carlo Tree Search for Planning from Scratch

04/05/2019
by   Yunlong Liu, et al.
0

Planning in stochastic and partially observable environments is a central issue in artificial intelligence. One commonly used technique for solving such a problem is by constructing an accurate model firstly. Although some recent approaches have been proposed for learning optimal behaviour under model uncertainty, prior knowledge about the environment is still needed to guarantee the performance of the proposed algorithms. With the benefits of the Predictive State Representations (PSRs) approach for state representation and model prediction, in this paper, we introduce an approach for planning from scratch, where an offline PSR model is firstly learned and then combined with online Monte-Carlo tree search for planning with model uncertainty. By comparing with the state-of-the-art approach of planning with model uncertainty, we demonstrated the effectiveness of the proposed approaches along with the proof of their convergence. The effectiveness and scalability of our proposed approach are also tested on the RockSample problem, which are infeasible for the state-of-the-art BA-POMDP based approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/11/2019

Online Learning and Planning in Partially Observable Domains without Prior Knowledge

How an agent can act optimally in stochastic, partially observable domai...
research
06/08/2021

Vector Quantized Models for Planning

Recent developments in the field of model-based RL have proven successfu...
research
12/12/2009

Closing the Learning-Planning Loop with Predictive State Representations

A central problem in artificial intelligence is that of planning to maxi...
research
12/11/2021

Retrosynthetic Planning with Experience-Guided Monte Carlo Tree Search

Retrosynthetic planning problem is to analyze a complex molecule and giv...
research
05/10/2019

Memory Bounded Open-Loop Planning in Large POMDPs using Thompson Sampling

State-of-the-art approaches to partially observable planning like POMCP ...
research
04/17/2020

Application of Progressive Hedging to Var Expansion Planning Under Uncertainty

This paper describes the application of a Progressive Hedging (PH) algor...

Please sign up or login with your details

Forgot password? Click here to reset