AI Hilbert: From Data and Background Knowledge to Automated Scientific Discovery

08/18/2023
by   Ryan Cory-Wright, et al.
0

The discovery of scientific formulae that parsimoniously explain natural phenomena and align with existing background theory is a key goal in science. Historically, scientists have derived natural laws by manipulating equations based on existing knowledge, forming new equations, and verifying them experimentally. In recent years, data-driven scientific discovery has emerged as a viable competitor in settings with large amounts of experimental data. Unfortunately, data-driven methods often fail to discover valid laws when data is noisy or scarce. Accordingly, recent works combine regression and reasoning to eliminate formulae inconsistent with background theory. However, the problem of searching over the space of formulae consistent with background theory to find one that fits the data best is not well solved. We propose a solution to this problem when all axioms and scientific laws are expressible via polynomial equalities and inequalities and argue that our approach is widely applicable. We further model notions of minimal complexity using binary variables and logical constraints, solve polynomial optimization problems via mixed-integer linear or semidefinite optimization, and automatically prove the validity of our scientific discoveries via Positivestellensatz certificates. Remarkably, the optimization techniques leveraged in this paper allow our approach to run in polynomial time with fully correct background theory, or non-deterministic polynomial (NP) time with partially correct background theory. We experimentally demonstrate that some famous scientific laws, including Kepler's Third Law of Planetary Motion, the Hagen-Poiseuille Equation, and the Radiated Gravitational Wave Power equation, can be automatically derived from sets of partially correct background axioms.

READ FULL TEXT

page 2

page 16

research
09/03/2021

Integration of Data and Theory for Accelerated Derivable Symbolic Discovery

Scientists have long aimed to discover meaningful equations which accura...
research
02/18/2021

Data-driven formulation of natural laws by recursive-LASSO-based symbolic regression

Discovery of new natural laws has for a long time relied on the inspirat...
research
12/01/2020

Probabilistic Grammars for Equation Discovery

Equation discovery, also known as symbolic regression, is a type of auto...
research
10/26/2020

Unsupervised discovery of interpretable hyperelastic constitutive laws

We propose a new approach for data-driven automated discovery of hyperel...
research
05/01/2021

Data-driven discovery of physical laws with human-understandable deep learning

There is an opportunity for deep learning to revolutionize science and t...
research
06/21/2023

Learning Homogenization for Elliptic Operators

Multiscale partial differential equations (PDEs) arise in various applic...
research
02/02/2022

AI Research Associate for Early-Stage Scientific Discovery

Artificial intelligence (AI) has been increasingly applied in scientific...

Please sign up or login with your details

Forgot password? Click here to reset