A Greedy Search Tree Heuristic for Symbolic Regression

Symbolic Regression tries to find a mathematical expression that describes the relationship of a set of explanatory variables to a measured variable. The main objective is to find a model that minimizes the error and, optionally, that also minimizes the expression size. A smaller expression can be seen as an interpretable model considered a reliable decision model. This is often performed with Genetic Programming which represents their solution as expression trees. The shortcoming of this algorithm lies on this representation that defines a rugged search space and contains expressions of any size and difficulty. These pose as a challenge to find the optimal solution under computational constraints. This paper introduces a new data structure, called Interaction-Transformation (IT), that constrains the search space in order to exclude a region of larger and more complicated expressions. In order to test this data structure, it was also introduced an heuristic called SymTree. The obtained results show evidence that SymTree are capable of obtaining the optimal solution whenever the target function is within the search space of the IT data structure and competitive results when it is not. Overall, the algorithm found a good compromise between accuracy and simplicity for all the generated models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/11/2019

Interaction-Transformation Evolutionary Algorithm for Symbolic Regression

The Interaction-Transformation (IT) is a new representation for Symbolic...
research
10/20/2019

Bayesian Symbolic Regression

Interpretability is crucial for machine learning in many scenarios such ...
research
09/28/2021

Cluster Analysis of a Symbolic Regression Search Space

In this chapter we take a closer look at the distribution of symbolic re...
research
04/25/2022

Transformation-Interaction-Rational Representation for Symbolic Regression

Symbolic Regression searches for a function form that approximates a dat...
research
05/25/2023

Symbolic Regression via Control Variable Genetic Programming

Learning symbolic expressions directly from experiment data is a vital s...
research
05/22/2017

Block building programming for symbolic regression

Symbolic regression that aims to detect underlying data-driven models ha...
research
09/02/2022

Analysis of a Greedy Heuristic for the Labeling of a Map with a Time-Window Interface

In this paper, we analyze the approximation quality of a greedy heuristi...

Please sign up or login with your details

Forgot password? Click here to reset