Learning via nonlinear conjugate gradients and depth-varying neural ODEs

02/11/2022
by   George Baravdish, et al.
0

The inverse problem of supervised reconstruction of depth-variable (time-dependent) parameters in a neural ordinary differential equation (NODE) is considered, that means finding the weights of a residual network with time continuous layers. The NODE is treated as an isolated entity describing the full network as opposed to earlier research, which embedded it between pre- and post-appended layers trained by conventional methods. The proposed parameter reconstruction is done for a general first order differential equation by minimizing a cost functional covering a variety of loss functions and penalty terms. A nonlinear conjugate gradient method (NCG) is derived for the minimization. Mathematical properties are stated for the differential equation and the cost functional. The adjoint problem needed is derived together with a sensitivity problem. The sensitivity problem can estimate changes in the network output under perturbation of the trained parameters. To preserve smoothness during the iterations the Sobolev gradient is calculated and incorporated. As a proof-of-concept, numerical results are included for a NODE and two synthetic datasets, and compared with standard gradient approaches (not based on NODEs). The results show that the proposed method works well for deep learning with infinite numbers of layers, and has built-in stability and smoothness.

READ FULL TEXT

page 21

page 22

page 23

research
11/24/2021

Numerical solution of a nonlinear functional integro-differential equation

In this paper, we consider a boundary value problem (BVP) for a fourth o...
research
04/24/2022

Numerical methods that preserve a Lyapunov function for Ordinary Differential Equations

The paper studies numerical methods that preserve a Lyapunov function of...
research
06/17/2022

Learning the parameters of a differential equation from its trajectory via the adjoint equation

The paper contributes to strengthening the relation between machine lear...
research
06/01/2023

Wavefront reconstruction of discontinuous phase objects from optical deflectometry

One of the challenges in phase measuring deflectometry is to retrieve th...
research
05/27/2022

Standalone Neural ODEs with Sensitivity Analysis

This paper presents the Standalone Neural ODE (sNODE), a continuous-dept...
research
06/16/2022

Closed-Form Diffeomorphic Transformations for Time Series Alignment

Time series alignment methods call for highly expressive, differentiable...
research
10/26/2014

A Novel Statistical Method Based on Dynamic Models for Classification

Realizations of stochastic process are often observed temporal data or f...

Please sign up or login with your details

Forgot password? Click here to reset