A general methodology to assess symbolic regression algorithms using the generation of random equations with uniform random sampling

06/18/2019
by   Sohrab Towfighi, et al.
0

Symbolic regression is the act of determining the ideal equation to fit a given dataset. Symbolic regression problems are typically solved using genetic algorithms. Being a metaheuristic approach to global optimization, genetic algorithms were previously conceived as a panacea solution to most computational problems. The paper presents a methodology to compare symbolic regression algorithms. The combinatorics of the problem space is explored and a novel method is described that allows users to count the number of possible equations in a defined problem space. The generation of full binary trees is discussed using a little known but remarkably simple dense enumeration which maps integers to unique binary trees. Though the set of all possible equations is infinite, the total number of equations is finite and specified once we limit our search to N binary trees, n functions, and m terminals. We provide a methodology to do uniform random sampling from this large but finite set of equations. We examine whether a simple evolutionary algorithm outperforms random search using thousands of randomly generated experiments and leverage arguments from elementary statistics. The methodology is generalizable and can be applied to compare symbolic regression algorithms.

READ FULL TEXT
research
06/18/2019

Symbolic regression by random search

Purpose: To compare symbolic regression by genetic programming (SRGP) wi...
research
06/20/2011

Dimensionally Constrained Symbolic Regression

We describe dimensionally constrained symbolic regression which has been...
research
06/25/2022

Binary and Multinomial Classification through Evolutionary Symbolic Regression

We present three evolutionary symbolic regression-based classification a...
research
02/20/2023

Efficient Generator of Mathematical Expressions for Symbolic Regression

We propose an approach to symbolic regression based on a novel variation...
research
05/24/2021

A Flawed Dataset for Symbolic Equation Verification

Arabshahi, Singh, and Anandkumar (2018) propose a method for creating a ...
research
06/13/2022

Symbolic Regression for Space Applications: Differentiable Cartesian Genetic Programming Powered by Multi-objective Memetic Algorithms

Interpretable regression models are important for many application domai...
research
06/14/2023

Probabilistic Regular Tree Priors for Scientific Symbolic Reasoning

Symbolic Regression (SR) allows for the discovery of scientific equation...

Please sign up or login with your details

Forgot password? Click here to reset