Formal derivation of Mesh Neural Networks with their Forward-Only gradient Propagation

05/16/2019
by   Federico A. Galatolo, et al.
0

This paper proposes the Mesh Neural Network (MNN), a novel architecture which allows neurons to be connected in any topology, to efficiently route information. In MNNs, information is propagated between neurons throughout a state transition function. State and error gradients are then directly computed from state updates without backward computation. The MNN architecture and the error propagation schema is formalized and derived in tensor algebra. The proposed computational model can fully supply a gradient descent process, and is suitable for very large scale NNs, due to its expressivity and training efficiency, with respect to NNs based on back-propagation and computational graphs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/28/2016

Alternating Back-Propagation for Generator Network

This paper proposes an alternating back-propagation algorithm for learni...
research
08/30/2022

On the universal consistency of an over-parametrized deep neural network estimate learned by gradient descent

Estimation of a multivariate regression function from independent and id...
research
09/24/2019

IR-Net: Forward and Backward Information Retention for Highly Accurate Binary Neural Networks

Weight and activation binarization is an effective approach to deep neur...
research
04/24/2017

A Neural Network model with Bidirectional Whitening

We present here a new model and algorithm which performs an efficient Na...
research
05/31/2019

Updates of Equilibrium Prop Match Gradients of Backprop Through Time in an RNN with Static Input

Equilibrium Propagation (EP) is a biologically inspired learning algorit...
research
03/03/2020

Deep Learning in Memristive Nanowire Networks

Analog crossbar architectures for accelerating neural network training a...
research
10/09/2015

Early Inference in Energy-Based Models Approximates Back-Propagation

We show that Langevin MCMC inference in an energy-based model with laten...

Please sign up or login with your details

Forgot password? Click here to reset