Symbolic Regression via Control Variable Genetic Programming

05/25/2023
by   Nan Jiang, et al.
0

Learning symbolic expressions directly from experiment data is a vital step in AI-driven scientific discovery. Nevertheless, state-of-the-art approaches are limited to learning simple expressions. Regressing expressions involving many independent variables still remain out of reach. Motivated by the control variable experiments widely utilized in science, we propose Control Variable Genetic Programming (CVGP) for symbolic regression over many independent variables. CVGP expedites symbolic expression discovery via customized experiment design, rather than learning from a fixed dataset collected a priori. CVGP starts by fitting simple expressions involving a small set of independent variables using genetic programming, under controlled experiments where other variables are held as constants. It then extends expressions learned in previous generations by adding new independent variables, using new control variable experiments in which these variables are allowed to vary. Theoretically, we show CVGP as an incremental building approach can yield an exponential reduction in the search space when learning a class of expressions. Experimentally, CVGP outperforms several baselines in learning symbolic expressions involving multiple independent variables.

READ FULL TEXT
research
09/13/2023

Racing Control Variable Genetic Programming for Symbolic Regression

Symbolic regression, as one of the most crucial tasks in AI for science,...
research
12/10/2019

Deep symbolic regression: Recovering mathematical expressions from data via policy gradients

Discovering the underlying mathematical expressions describing a dataset...
research
12/07/2020

Estimation of Gas Turbine Shaft Torque and Fuel Flow of a CODLAG Propulsion System Using Genetic Programming Algorithm

In this paper, the publicly available dataset of condition based mainten...
research
06/07/2023

Neural Symbolic Regression using Control Variables

Symbolic regression (SR) is a powerful technique for discovering the ana...
research
09/23/2013

Data Mining using Unguided Symbolic Regression on a Blast Furnace Dataset

In this paper a data mining approach for variable selection and knowledg...
research
01/04/2018

A Greedy Search Tree Heuristic for Symbolic Regression

Symbolic Regression tries to find a mathematical expression that describ...
research
01/23/2019

Neural-Guided Symbolic Regression with Semantic Prior

Symbolic regression has been shown to be quite useful in many domains fr...

Please sign up or login with your details

Forgot password? Click here to reset