Feedforward and Recurrent Neural Networks Backward Propagation and Hessian in Matrix Form

09/16/2017
by   Maxim Naumov, et al.
0

In this paper we focus on the linear algebra theory behind feedforward (FNN) and recurrent (RNN) neural networks. We review backward propagation, including backward propagation through time (BPTT). Also, we obtain a new exact expression for Hessian, which represents second order effects. We show that for t time steps the weight gradient can be expressed as a rank-t matrix, while the weight Hessian is as a sum of t^2 Kronecker products of rank-1 and W^TAW matrices, for some matrix A and weight matrix W. Also, we show that for a mini-batch of size r, the weight update can be expressed as a rank-rt matrix. Finally, we briefly comment on the eigenvalues of the Hessian matrix.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/24/2021

A Provably Componentwise Backward Stable O(n^2) QR Algorithm for the Diagonalization of Colleague Matrices

The roots of a monic polynomial expressed in a Chebyshev basis are known...
research
12/18/2017

Parallel Complexity of Forward and Backward Propagation

We show that the forward and backward propagation can be formulated as a...
research
07/07/2021

Efficient Matrix-Free Approximations of Second-Order Information, with Applications to Pruning and Optimization

Efficiently approximating local curvature information of the loss functi...
research
05/14/2019

Efficient Computation of Hessian Matrices in TensorFlow

The Hessian matrix has a number of important applications in a variety o...
research
01/16/2013

Training Neural Networks with Stochastic Hessian-Free Optimization

Hessian-free (HF) optimization has been successfully used for training d...
research
10/27/2003

Feedforward Neural Networks with Diffused Nonlinear Weight Functions

In this paper, feedforward neural networks are presented that have nonli...
research
09/11/2015

Hessian-free Optimization for Learning Deep Multidimensional Recurrent Neural Networks

Multidimensional recurrent neural networks (MDRNNs) have shown a remarka...

Please sign up or login with your details

Forgot password? Click here to reset