Speeding-Up Back-Propagation in DNN: Approximate Outer Product with Memory

10/18/2021
by   Eduin E. Hernandez, et al.
0

In this paper, an algorithm for approximate evaluation of back-propagation in DNN training is considered, which we term Approximate Outer Product Gradient Descent with Memory (Mem-AOP-GD). The Mem-AOP-GD algorithm implements an approximation of the stochastic gradient descent by considering only a subset of the outer products involved in the matrix multiplications that encompass backpropagation. In order to correct for the inherent bias in this approximation, the algorithm retains in memory an accumulation of the outer products that are not used in the approximation. We investigate the performance of the proposed algorithm in terms of DNN training loss under two design parameters: (i) the number of outer products used for the approximation, and (ii) the policy used to select such outer products. We experimentally show that significant improvements in computational complexity as well as accuracy can indeed be obtained through Mem-AOPGD.

READ FULL TEXT
research
07/19/2022

Riemannian Stochastic Gradient Method for Nested Composition Optimization

This work considers optimization of composition of functions in a nested...
research
10/13/2022

Reinforcement Learning with Unbiased Policy Evaluation and Linear Function Approximation

We provide performance guarantees for a variant of simulation-based poli...
research
04/19/2023

Column Subset Selection and Nyström Approximation via Continuous Optimization

We propose a continuous optimization algorithm for the Column Subset Sel...
research
11/05/2018

PILAE: A Non-gradient Descent Learning Scheme for Deep Feedforward Neural Networks

In this work, a non-gradient descent learning scheme is proposed for dee...
research
11/19/2022

Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function

Meta-gradient Reinforcement Learning (RL) allows agents to self-tune the...
research
06/15/2020

Algorithmically Optimal Outer Measures

We investigate the relationship between algorithmic fractal dimensions a...
research
10/09/2018

The Outer Product Structure of Neural Network Derivatives

In this paper, we show that feedforward and recurrent neural networks ex...

Please sign up or login with your details

Forgot password? Click here to reset