Neural Differential Equations for Learning to Program Neural Nets Through Continuous Learning Rules

06/03/2022
by   Kazuki Irie, et al.
14

Neural ordinary differential equations (ODEs) have attracted much attention as continuous-time counterparts of deep residual neural networks (NNs), and numerous extensions for recurrent NNs have been proposed. Since the 1980s, ODEs have also been used to derive theoretical results for NN learning rules, e.g., the famous connection between Oja's rule and principal component analysis. Such rules are typically expressed as additive iterative update processes which have straightforward ODE counterparts. Here we introduce a novel combination of learning rules and Neural ODEs to build continuous-time sequence processing nets that learn to manipulate short-term memory in rapidly changing synaptic connections of other nets. This yields continuous-time counterparts of Fast Weight Programmers and linear Transformers. Our novel models outperform the best existing Neural Controlled Differential Equation based models on various time series classification tasks, while also addressing their scalability limitations. Our code is public.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/02/2019

Differential Bayesian Neural Nets

Neural Ordinary Differential Equations (N-ODEs) are a powerful building ...
research
05/11/2023

Generalization bounds for neural ordinary differential equations and deep residual networks

Neural ordinary differential equations (neural ODEs) are a popular famil...
research
04/06/2023

Unconstrained Parametrization of Dissipative and Contracting Neural Ordinary Differential Equations

In this work, we introduce and study a class of Deep Neural Networks (DN...
research
10/28/2022

Toward Equation of Motion for Deep Neural Networks: Continuous-time Gradient Descent and Discretization Error Analysis

We derive and solve an “Equation of Motion” (EoM) for deep neural networ...
research
05/25/2019

Application and Computation of Probabilistic Neural Plasticity

The discovery of neural plasticity has proved that throughout the life o...
research
11/18/2019

Graph Neural Ordinary Differential Equations

We extend the framework of graph neural networks (GNN) to continuous tim...
research
05/05/2020

Time Dependence in Non-Autonomous Neural ODEs

Neural Ordinary Differential Equations (ODEs) are elegant reinterpretati...

Please sign up or login with your details

Forgot password? Click here to reset