Learning to Plan via Neural Exploration-Exploitation Trees

02/28/2019
by   Binghong Chen, et al.
0

Sampling-based algorithms such as RRT and its variants are powerful tools for path planning problems in high-dimensional continuous state and action spaces. While these algorithms perform systematic exploration of the state space, they do not fully exploit past planning experiences from similar environments. In this paper, we design a meta path planning algorithm, called Neural Exploration-Exploitation Trees (NEXT), which can exploit past experience to drastically reduce the sample requirement for solving new path planning problems. More specifically, NEXT contains a novel neural architecture which can learn from experiences the dependency between task structures and promising path search directions. Then this learned prior is integrated with a UCB-type algorithm to achieve an online balance between exploration and exploitation when solving a new problem. Empirically, we show that NEXT can complete the planning tasks with very small searching trees and significantly outperforms previous state-of-the-arts on several benchmark problems.

READ FULL TEXT

page 13

page 20

page 21

page 22

research
12/01/2009

A Multi-stage Probabilistic Algorithm for Dynamic Path-Planning

Probabilistic sampling methods have become very popular to solve single-...
research
02/28/2021

Path Planning for Manipulation using Experience-driven Random Trees

Robotic systems may frequently come across similar manipulation planning...
research
09/06/2016

Q-Learning with Basic Emotions

Q-learning is a simple and powerful tool in solving dynamic problems whe...
research
06/19/2021

Learning Space Partitions for Path Planning

Path planning, the problem of efficiently discovering high-reward trajec...
research
06/10/2023

Contribution à l'Optimisation d'un Comportement Collectif pour un Groupe de Robots Autonomes

This thesis studies the domain of collective robotics, and more particul...
research
08/17/2015

Reasoning in complex environments with the SelectScript declarative language

SelectScript is an extendable, adaptable, and declarative domain-specifi...
research
03/04/2021

Estimation and Planning of Exploration Over Grid Map Using A Spatiotemporal Model with Incomplete State Observations

Path planning over spatiotemporal models can be applied to a variety of ...

Please sign up or login with your details

Forgot password? Click here to reset