Accelerating Understanding of Scientific Experiments with End to End Symbolic Regression

12/07/2021
by   Nikos Arechiga, et al.
0

We consider the problem of learning free-form symbolic expressions from raw data, such as that produced by an experiment in any scientific domain. Accurate and interpretable models of scientific phenomena are the cornerstone of scientific research. Simple yet interpretable models, such as linear or logistic regression and decision trees often lack predictive accuracy. Alternatively, accurate blackbox models such as deep neural networks provide high predictive accuracy, but do not readily admit human understanding in a way that would enrich the scientific theory of the phenomenon. Many great breakthroughs in science revolve around the development of parsimonious equational models with high predictive accuracy, such as Newton's laws, universal gravitation, and Maxwell's equations. Previous work on automating the search of equational models from data combine domain-specific heuristics as well as computationally expensive techniques, such as genetic programming and Monte-Carlo search. We develop a deep neural network (MACSYMA) to address the symbolic regression problem as an end-to-end supervised learning problem. MACSYMA can generate symbolic expressions that describe a dataset. The computational complexity of the task is reduced to the feedforward computation of a neural network. We train our neural network on a synthetic dataset consisting of data tables of varying length and varying levels of noise, for which the neural network must learn to produce the correct symbolic expression token by token. Finally, we validate our technique by running on a public dataset from behavioral science.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/01/2022

Deep Learning and Symbolic Regression for Discovering Parametric Equations

Symbolic regression is a machine learning technique that can learn the g...
research
12/10/2019

Integration of Neural Network-Based Symbolic Regression in Deep Learning for Scientific Discovery

Symbolic regression is a powerful technique that can discover analytical...
research
02/22/2023

Deep Generative Symbolic Regression with Monte-Carlo-Tree-Search

Symbolic regression (SR) is the problem of learning a symbolic expressio...
research
04/25/2017

Learning of Human-like Algebraic Reasoning Using Deep Feedforward Neural Networks

There is a wide gap between symbolic reasoning and deep learning. In thi...
research
10/10/2022

Neurosymbolic Programming for Science

Neurosymbolic Programming (NP) techniques have the potential to accelera...
research
10/30/2017

Contextual Regression: An Accurate and Conveniently Interpretable Nonlinear Model for Mining Discovery from Scientific Data

Machine learning algorithms such as linear regression, SVM and neural ne...
research
04/03/2023

Interpretable Symbolic Regression for Data Science: Analysis of the 2022 Competition

Symbolic regression searches for analytic expressions that accurately de...

Please sign up or login with your details

Forgot password? Click here to reset