Parallel Complexity of Forward and Backward Propagation

12/18/2017
by   Maxim Naumov, et al.
0

We show that the forward and backward propagation can be formulated as a solution of lower and upper triangular systems of equations. For standard feedforward (FNNs) and recurrent neural networks (RNNs) the triangular systems are always block bi-diagonal, while for a general computation graph (directed acyclic graph) they can have a more complex triangular sparsity pattern. We discuss direct and iterative parallel algorithms that can be used for their solution and interpreted as different ways of performing model parallelism. Also, we show that for FNNs and RNNs with k layers and τ time steps the backward propagation can be performed in parallel in O( k) and O( k τ) steps, respectively. Finally, we outline the generalization of this technique using Jacobians that potentially allows us to handle arbitrary layers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/16/2017

Feedforward and Recurrent Neural Networks Backward Propagation and Hessian in Matrix Form

In this paper we focus on the linear algebra theory behind feedforward (...
research
11/05/2019

PARAOPT: A parareal algorithm for optimality systems

The time parallel solution of optimality systems arising in PDE constrai...
research
03/19/2018

Projective Splitting with Forward Steps: Asynchronous and Block-Iterative Operator Splitting

This work is concerned with the classical problem of finding a zero of a...
research
04/01/2021

Optimizer Fusion: Efficient Training with Better Locality and Parallelism

Machine learning frameworks adopt iterative optimizers to train neural n...
research
02/15/2018

Event Nugget Detection with Forward-Backward Recurrent Neural Networks

Traditional event detection methods heavily rely on manually engineered ...
research
11/22/2019

SparseTrain:Leveraging Dynamic Sparsity in Training DNNs on General-Purpose SIMD Processors

Our community has greatly improved the efficiency of deep learning appli...
research
11/09/2022

Accelerating Adversarial Perturbation by 50 Propagation

Adversarial perturbation plays a significant role in the field of advers...

Please sign up or login with your details

Forgot password? Click here to reset