A Differentiable Loss Function for Learning Heuristics in A*

09/12/2022
by   Leah Chrestien, et al.
0

Optimization of heuristic functions for the A* algorithm, realized by deep neural networks, is usually done by minimizing square root loss of estimate of the cost to goal values. This paper argues that this does not necessarily lead to a faster search of A* algorithm since its execution relies on relative values instead of absolute ones. As a mitigation, we propose a L* loss, which upper-bounds the number of excessively expanded states inside the A* search. The L* loss, when used in the optimization of state-of-the-art deep neural networks for automated planning in maze domains like Sokoban and maze with teleports, significantly improves the fraction of solved problems, the quality of founded plans, and reduces the number of expanded states to approximately 50

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/27/2013

Integrating Planning and Execution in Stochastic Domains

We investigate planning in time-critical domains represented as Markov D...
research
03/21/2021

Policy-Guided Heuristic Search with Guarantees

The use of a policy and a heuristic function for guiding search can be q...
research
02/08/2021

A* Search Without Expansions: Learning Heuristic Functions with Deep Q-Networks

A* search is an informed search algorithm that uses a heuristic function...
research
09/30/2018

Nth Absolute Root Mean Error

Neural network training process takes long time when the size of trainin...
research
08/04/2020

Analyzing Upper Bounds on Mean Absolute Errors for Deep Neural Network Based Vector-to-Vector Regression

In this paper, we show that, in vector-to-vector regression utilizing de...
research
11/23/2022

Understanding Sample Generation Strategies for Learning Heuristic Functions in Classical Planning

We study the problem of learning good heuristic functions for classical ...
research
05/26/2023

Levin Tree Search with Context Models

Levin Tree Search (LTS) is a search algorithm that makes use of a policy...

Please sign up or login with your details

Forgot password? Click here to reset