Predict Globally, Correct Locally: Parallel-in-Time Optimal Control of Neural Networks

02/07/2019
by   Panos Parpas, et al.
0

The links between optimal control of dynamical systems and neural networks have proved beneficial both from a theoretical and from a practical point of view. Several researchers have exploited these links to investigate the stability of different neural network architectures and develop memory efficient training algorithms. We also adopt the dynamical systems view of neural networks, but our aim is different from earlier works. We exploit the links between dynamical systems, optimal control, and neural networks to develop a novel distributed optimization algorithm. The proposed algorithm addresses the most significant obstacle for distributed algorithms for neural network optimization: the network weights cannot be updated until the forward propagation of the data, and backward propagation of the gradients are complete. Using the dynamical systems point of view, we interpret the layers of a (residual) neural network as the discretized dynamics of a dynamical system and exploit the relationship between the co-states (adjoints) of the optimal control problem and backpropagation. We then develop a parallel-in-time method that updates the parameters of the network without waiting for the forward or back propagation algorithms to complete in full. We establish the convergence of the proposed algorithm. Preliminary numerical results suggest that the algorithm is competitive and more efficient than the state-of-the-art.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/17/2020

NNC: Neural-Network Control of Dynamical Systems on Graphs

We study the ability of neural networks to steer or control trajectories...
research
07/27/2018

On the overfly algorithm in deep learning of neural networks

In this paper we investigate the supervised backpropagation training of ...
research
10/27/2017

Multi-level Residual Networks from Dynamical Systems View

Deep residual networks (ResNets) and their variants are widely used in m...
research
12/16/2020

Physical deep learning based on optimal control of dynamical systems

A central topic in recent artificial intelligence technologies is deep l...
research
03/06/2021

Artificial neural network as a universal model of nonlinear dynamical systems

We suggest a universal map capable to recover a behavior of a wide range...
research
07/17/2020

A Differential Game Theoretic Neural Optimizer for Training Residual Networks

Connections between Deep Neural Networks (DNNs) training and optimal con...
research
11/26/2020

Spectral Analysis and Stability of Deep Neural Dynamics

Our modern history of deep learning follows the arc of famous emergent d...

Please sign up or login with your details

Forgot password? Click here to reset