Utilizing Admissible Bounds for Heuristic Learning

08/23/2023
by   Carlos Núñez-Molina, et al.
0

While learning a heuristic function for forward search algorithms with modern machine learning techniques has been gaining interest in recent years, there has been little theoretical understanding of what they should learn, how to train them, and why we do so. This lack of understanding leads to various literature performing an ad-hoc selection of datasets (suboptimal vs optimal costs or admissible vs inadmissible heuristics) and optimization metrics (e.g., squared vs absolute errors). Moreover, due to the lack of admissibility of the resulting trained heuristics, little focus has been put on the role of admissibility during learning. This paper articulates the role of admissible heuristics in supervised heuristic learning using them as parameters of Truncated Gaussian distributions, which tightens the hypothesis space compared to ordinary Gaussian distributions. We argue that this mathematical model faithfully follows the principle of maximum entropy and empirically show that, as a result, it yields more accurate heuristics and converges faster during training.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/03/2016

Learning to Rank for Synthesizing Planning Heuristics

We investigate learning heuristics for domain-specific planning. Prior w...
research
07/08/2013

Inconsistency and Accuracy of Heuristics with A* Search

Many studies in heuristic search suggest that the accuracy of the heuris...
research
09/28/2017

Deep Learning Assisted Heuristic Tree Search for the Container Pre-marshalling Problem

One of the key challenges for operations researchers solving real-world ...
research
03/16/2023

Learning Local Heuristics for Search-Based Navigation Planning

Graph search planning algorithms for navigation typically rely heavily o...
research
11/24/2014

Rational Deployment of Multiple Heuristics in IDA*

Recent advances in metareasoning for search has shown its usefulness in ...
research
03/29/2023

Training Feedforward Neural Networks with Bayesian Hyper-Heuristics

The process of training feedforward neural networks (FFNNs) can benefit ...
research
06/20/2019

ID3 Learns Juntas for Smoothed Product Distributions

In recent years, there are many attempts to understand popular heuristic...

Please sign up or login with your details

Forgot password? Click here to reset