Heavy Ball Neural Ordinary Differential Equations

10/10/2021
by   Hedi Xia, et al.
15

We propose heavy ball neural ordinary differential equations (HBNODEs), leveraging the continuous limit of the classical momentum accelerated gradient descent, to improve neural ODEs (NODEs) training and inference. HBNODEs have two properties that imply practical advantages over NODEs: (i) The adjoint state of an HBNODE also satisfies an HBNODE, accelerating both forward and backward ODE solvers, thus significantly reducing the number of function evaluations (NFEs) and improving the utility of the trained models. (ii) The spectrum of HBNODEs is well structured, enabling effective learning of long-term dependencies from complex sequential data. We verify the advantages of HBNODEs over NODEs on benchmark tasks, including image classification, learning complex dynamics, and sequential modeling. Our method requires remarkably fewer forward and backward NFEs, is more accurate, and learns long-term dependencies more effectively than the other ODE-based neural network models. Code is available at <https://github.com/hedixia/HeavyBallNODE>.

READ FULL TEXT
research
07/13/2022

AdamNODEs: When Neural ODE Meets Adaptive Moment Estimation

Recent work by Xia et al. leveraged the continuous-limit of the classica...
research
12/20/2022

Learning Subgrid-scale Models with Neural Ordinary Differential Equations

We propose a new approach to learning the subgrid-scale model when simul...
research
02/24/2022

Learning POD of Complex Dynamics Using Heavy-ball Neural ODEs

Proper orthogonal decomposition (POD) allows reduced-order modeling of c...
research
10/13/2021

How Does Momentum Benefit Deep Neural Networks Architecture Design? A Few Case Studies

We present and review an algorithmic and theoretical framework for impro...
research
02/05/2022

LyaNet: A Lyapunov Framework for Training Neural ODEs

We propose a method for training ordinary differential equations by usin...
research
03/15/2021

Meta-Solver for Neural Ordinary Differential Equations

A conventional approach to train neural ordinary differential equations ...
research
06/03/2020

Adaptive Checkpoint Adjoint Method for Gradient Estimation in Neural ODE

Neural ordinary differential equations (NODEs) have recently attracted i...

Please sign up or login with your details

Forgot password? Click here to reset