When are Neural ODE Solutions Proper ODEs?

07/30/2020
by   Katharina Ott, et al.
10

A key appeal of the recently proposed Neural Ordinary Differential Equation(ODE) framework is that it seems to provide a continuous-time extension of discrete residual neural networks. As we show herein, though, trained Neural ODE models actually depend on the specific numerical method used during training. If the trained model is supposed to be a flow generated from an ODE, it should be possible to choose another numerical solver with equal or smaller numerical error without loss of performance. We observe that if training relies on a solver with overly coarse discretization, then testing with another solver of equal or smaller numerical error results in a sharp drop in accuracy. In such cases, the combination of vector field and numerical method cannot be interpreted as a flow generated from an ODE, which arguably poses a fatal breakdown of the Neural ODE concept. We observe, however, that there exists a critical step size beyond which the training yields a valid ODE vector field. We propose a method that monitors the behavior of the ODE solver during training to adapt its step size, aiming to ensure a valid ODE without unnecessarily increasing computational cost. We verify this adaption algorithm on two common bench mark datasets as well as a synthetic dataset. Furthermore, we introduce a novel synthetic dataset in which the underlying ODE directly generates a classification task.

READ FULL TEXT
research
06/19/2018

Neural Ordinary Differential Equations

We introduce a new family of deep neural network models. Instead of spec...
research
05/09/2021

Opening the Blackbox: Accelerating Neural Differential Equations by Regularizing Internal Solver Heuristics

Democratization of machine learning requires architectures that automati...
research
09/03/2023

Implicit regularization of deep residual networks towards neural ODEs

Residual neural networks are state-of-the-art deep learning models. Thei...
research
07/03/2023

Understanding the impact of numerical solvers on inference for differential equation models

Most ordinary differential equation (ODE) models used to describe biolog...
research
09/20/2020

"Hey, that's not an ODE": Faster ODE Adjoints with 12 Lines of Code

Neural differential equations may be trained by backpropagating gradient...
research
11/06/2020

On the Ergodicity, Bias and Asymptotic Normality of Randomized Midpoint Sampling Method

The randomized midpoint method, proposed by [SL19], has emerged as an op...
research
05/10/2023

A synchronization-capturing multi-scale solver to the noisy integrate-and-fire neuron networks

The noisy leaky integrate-and-fire (NLIF) model describes the voltage co...

Please sign up or login with your details

Forgot password? Click here to reset