Scaling up ML-based Black-box Planning with Partial STRIPS Models

07/10/2022
by   Matias Greco, et al.
9

A popular approach for sequential decision-making is to perform simulator-based search guided with Machine Learning (ML) methods like policy learning. On the other hand, model-relaxation heuristics can guide the search effectively if a full declarative model is available. In this work, we consider how a practitioner can improve ML-based black-box planning on settings where a complete symbolic model is not available. We show that specifying an incomplete STRIPS model that describes only part of the problem enables the use of relaxation heuristics. Our findings on several planning domains suggest that this is an effective way to improve ML-based black-box planning beyond collecting more data or tuning ML architectures.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/27/2020

Testing Monotonicity of Machine Learning Models

Today, machine learning (ML) models are increasingly applied in decision...
research
09/28/2022

MLink: Linking Black-Box Models from Multiple Domains for Collaborative Inference

The cost efficiency of model inference is critical to real-world machine...
research
04/30/2023

Interpretability of Machine Learning: Recent Advances and Future Prospects

The proliferation of machine learning (ML) has drawn unprecedented inter...
research
07/17/2023

DeepMem: ML Models as storage channels and their (mis-)applications

Machine learning (ML) models are overparameterized to support generality...
research
12/11/2019

Neural-Symbolic Descriptive Action Model from Images: The Search for STRIPS

Recent work on Neural-Symbolic systems that learn the discrete planning ...
research
01/28/2019

Embedding is not Cipher: Understanding the risk of embedding leakages

Machine Learning (ML) already has been integrated into all kinds of syst...
research
07/17/2020

Transfer Learning without Knowing: Reprogramming Black-box Machine Learning Models with Scarce Data and Limited Resources

Current transfer learning methods are mainly based on finetuning a pretr...

Please sign up or login with your details

Forgot password? Click here to reset