Interpretable Machine Learning for Science with PySR and SymbolicRegression.jl

05/02/2023
by   Miles Cranmer, et al.
0

PySR is an open-source library for practical symbolic regression, a type of machine learning which aims to discover human-interpretable symbolic models. PySR was developed to democratize and popularize symbolic regression for the sciences, and is built on a high-performance distributed back-end, a flexible search algorithm, and interfaces with several deep learning packages. PySR's internal search algorithm is a multi-population evolutionary algorithm, which consists of a unique evolve-simplify-optimize loop, designed for optimization of unknown scalar constants in newly-discovered empirical expressions. PySR's backend is the extremely optimized Julia library SymbolicRegression.jl, which can be used directly from Julia. It is capable of fusing user-defined operators into SIMD kernels at runtime, performing automatic differentiation, and distributing populations of expressions to thousands of cores across a cluster. In describing this software, we also introduce a new benchmark, "EmpiricalBench," to quantify the applicability of symbolic regression algorithms in science. This benchmark measures recovery of historical empirical equations from original and synthetic datasets.

READ FULL TEXT

page 13

page 15

research
11/20/2022

Interpretable Scientific Discovery with Symbolic Regression: A Review

Symbolic regression is emerging as a promising machine learning method f...
research
02/20/2023

Efficient Generator of Mathematical Expressions for Symbolic Regression

We propose an approach to symbolic regression based on a novel variation...
research
10/29/2021

Symbolic Regression via Neural-Guided Genetic Programming Population Seeding

Symbolic regression is the process of identifying mathematical expressio...
research
04/25/2018

Where are we now? A large benchmark study of recent symbolic regression methods

In this paper we provide a broad benchmarking of recent genetic programm...
research
07/25/2022

Automated discovery of interpretable gravitational-wave population models

We present an automatic approach to discover analytic population models ...
research
05/06/2023

Symbolic Regression on FPGAs for Fast Machine Learning Inference

The high-energy physics community is investigating the feasibility of de...
research
12/22/2021

Analytical Modelling of Exoplanet Transit Specroscopy with Dimensional Analysis and Symbolic Regression

The physical characteristics and atmospheric chemical composition of new...

Please sign up or login with your details

Forgot password? Click here to reset