Imbedding Deep Neural Networks

01/31/2022
by   Andrew Corbett, et al.
3

Continuous depth neural networks, such as Neural ODEs, have refashioned the understanding of residual neural networks in terms of non-linear vector-valued optimal control problems. The common solution is to use the adjoint sensitivity method to replicate a forward-backward pass optimisation problem. We propose a new approach which explicates the network's `depth' as a fundamental variable, thus reducing the problem to a system of forward-facing initial value problems. This new method is based on the principle of `Invariant Imbedding' for which we prove a general solution, applicable to all non-linear, vector-valued optimal control problems with both running and terminal loss. Our new architectures provide a tangible tool for inspecting the theoretical–and to a great extent unexplained–properties of network depth. They also constitute a resource of discrete implementations of Neural ODEs comparable to classes of imbedded residual neural networks. Through a series of experiments, we show the competitive performance of the proposed architectures for supervised learning and time series prediction.

READ FULL TEXT

page 8

page 19

page 20

research
06/22/2020

Forward-Backward RRT: Branched Sampled FBSDEs for Stochastic Optimal Control

We propose a numerical method to solve forward-backward stochastic diffe...
research
12/11/2018

Layer-Parallel Training of Deep Residual Neural Networks

Residual neural networks (ResNets) are a promising class of deep neural ...
research
12/15/2019

Symplectic Runge-Kutta discretization of a regularized forward-backward sweep iteration for optimal control problems

Li, Chen, Tai E. (J. Machine Learning Research, 2018) have proposed ...
research
02/05/2023

On the numerical stability of discretised Optimal Control Problems

Optimal Control Problems consist on the optimisation of an objective fun...
research
11/24/2018

Forward Stability of ResNet and Its Variants

The residual neural network (ResNet) is a popular deep network architect...
research
02/04/2021

Continuous Random Variable Estimation is not Optimal for the Witsenhausen Counterexample

Optimal design of distributed decision policies can be a difficult task,...

Please sign up or login with your details

Forgot password? Click here to reset