Symbolic Regression by Exhaustive Search: Reducing the Search Space Using Syntactical Constraints and Efficient Semantic Structure Deduplication

09/28/2021
by   Lukas Kammerer, et al.
0

Symbolic regression is a powerful system identification technique in industrial scenarios where no prior knowledge on model structure is available. Such scenarios often require specific model properties such as interpretability, robustness, trustworthiness and plausibility, that are not easily achievable using standard approaches like genetic programming for symbolic regression. In this chapter we introduce a deterministic symbolic regression algorithm specifically designed to address these issues. The algorithm uses a context-free grammar to produce models that are parameterized by a non-linear least squares local optimization procedure. A finite enumeration of all possible models is guaranteed by structural restrictions as well as a caching mechanism for detecting semantically equivalent solutions. Enumeration order is established via heuristics designed to improve search efficiency. Empirical tests on a comprehensive benchmark suite show that our approach is competitive with genetic programming in many noiseless problems while maintaining desirable properties such as simple, reliable models and reproducibility.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/28/2021

Cluster Analysis of a Symbolic Regression Search Space

In this chapter we take a closer look at the distribution of symbolic re...
research
04/03/2017

A Probabilistic Linear Genetic Programming with Stochastic Context-Free Grammar for solving Symbolic Regression problems

Traditional Linear Genetic Programming (LGP) algorithms are based only o...
research
07/22/2021

Hash-Based Tree Similarity and Simplification in Genetic Programming for Symbolic Regression

We introduce in this paper a runtime-efficient tree hashing algorithm fo...
research
04/24/2020

Symbolic Regression Driven by Training Data and Prior Knowledge

In symbolic regression, the search for analytic models is typically driv...
research
09/20/2022

Symbolic Regression with Fast Function Extraction and Nonlinear Least Squares Optimization

Fast Function Extraction (FFX) is a deterministic algorithm for solving ...
research
04/17/2017

Interval Arithmetic and Interval-Aware Operators for Genetic Programming

Symbolic regression via genetic programming is a flexible approach to ma...
research
05/22/2017

Block building programming for symbolic regression

Symbolic regression that aims to detect underlying data-driven models ha...

Please sign up or login with your details

Forgot password? Click here to reset