A toolkit for data-driven discovery of governing equations in high-noise regimes

11/08/2021
by   Charles B. Delahunt, et al.
19

We consider the data-driven discovery of governing equations from time-series data in the limit of high noise. The algorithms developed describe an extensive toolkit of methods for circumventing the deleterious effects of noise in the context of the sparse identification of nonlinear dynamics (SINDy) framework. We offer two primary contributions, both focused on noisy data acquired from a system x' = f(x). First, we propose, for use in high-noise settings, an extensive toolkit of critically enabling extensions for the SINDy regression method, to progressively cull functionals from an over-complete library and yield a set of sparse equations that regress to the derivate x'. These innovations can extract sparse governing equations and coefficients from high-noise time-series data (e.g. 300 the correct sparse libraries in the Lorenz system, with median coefficient estimate errors equal to 1 23 into a single method, but the individual modules can be tactically applied in other equation discovery methods (SINDy or not) to improve results on high-noise data. Second, we propose a technique, applicable to any model discovery method based on x' = f(x), to assess the accuracy of a discovered model in the context of non-unique solutions due to noisy data. Currently, this non-uniqueness can obscure a discovered model's accuracy and thus a discovery method's effectiveness. We describe a technique that uses linear dependencies among functionals to transform a discovered model into an equivalent form that is closest to the true model, enabling more accurate assessment of a discovered model's accuracy.

READ FULL TEXT

page 12

page 21

page 28

page 30

page 31

research
05/05/2022

PyDaddy: A Python package for discovering stochastic dynamical equations from timeseries data

Most real-world ecological dynamics, ranging from ecosystem dynamics to ...
research
11/19/2022

Bayesian autoencoders for data-driven discovery of coordinates, governing equations and fundamental constants

Recent progress in autoencoder-based sparse identification of nonlinear ...
research
12/29/2022

Investigating Sindy As a Tool For Causal Discovery In Time Series Signals

The SINDy algorithm has been successfully used to identify the governing...
research
05/09/2020

Weak SINDy: Galerkin-Based Data-Driven Model Selection

We present a weak formulation and discretization of the system discovery...
research
05/09/2020

Weak SINDy: A Data-Driven Galerkin Method for System Identification

We present a weak formulation and discretization of the system discovery...
research
06/08/2023

Learning Closed-form Equations for Subgrid-scale Closures from High-fidelity Data: Promises and Challenges

There is growing interest in discovering interpretable, closed-form equa...
research
10/01/2021

Closed-form discovery of structural errors in models of chaotic systems by integrating Bayesian sparse regression and data assimilation

Models used for many important engineering and natural systems are imper...

Please sign up or login with your details

Forgot password? Click here to reset