Learning Heuristic Search via Imitation

07/10/2017
by   Mohak Bhardwaj, et al.
0

Robotic motion planning problems are typically solved by constructing a search tree of valid maneuvers from a start to a goal configuration. Limited onboard computation and real-time planning constraints impose a limit on how large this search tree can grow. Heuristics play a crucial role in such situations by guiding the search towards potentially good directions and consequently minimizing search effort. Moreover, it must infer such directions in an efficient manner using only the information uncovered by the search up until that time. However, state of the art methods do not address the problem of computing a heuristic that explicitly minimizes search effort. In this paper, we do so by training a heuristic policy that maps the partial information from the search to decide which node of the search tree to expand. Unfortunately, naively training such policies leads to slow convergence and poor local minima. We present SaIL, an efficient algorithm that trains heuristic policies by imitating "clairvoyant oracles" - oracles that have full information about the world and demonstrate decisions that minimize search effort. We leverage the fact that such oracles can be efficiently computed using dynamic programming and derive performance guarantees for the learnt heuristic. We validate the approach on a spectrum of environments which show that SaIL consistently outperforms state of the art algorithms. Our approach paves the way forward for learning heuristics that demonstrate an anytime nature - finding feasible solutions quickly and incrementally refining it over time.

READ FULL TEXT

page 2

page 7

research
11/17/2017

Data-driven Planning via Imitation Learning

Robot planning is the process of selecting a sequence of actions that op...
research
04/04/2019

Generalized Lazy Search for Robot Motion Planning: Interleaving Search and Edge Evaluation via Event-based Toggles

Lazy search algorithms can efficiently solve problems where edge evaluat...
research
09/28/2017

Deep Learning Assisted Heuristic Tree Search for the Container Pre-marshalling Problem

One of the key challenges for operations researchers solving real-world ...
research
07/16/2019

Leveraging Experience in Lazy Search

Lazy graph search algorithms are efficient at solving motion planning pr...
research
10/28/2020

Bidirectional Sampling Based Search Without Two Point Boundary Value Solution

Bidirectional path and motion planning approaches decrease planning time...
research
05/17/2022

Effort Informed Roadmaps (EIRM*): Efficient Asymptotically Optimal Multiquery Planning by Actively Reusing Validation Effort

Multiquery planning algorithms find paths between various different star...
research
04/11/2011

Rational Deployment of CSP Heuristics

Heuristics are crucial tools in decreasing search effort in varied field...

Please sign up or login with your details

Forgot password? Click here to reset