A Novel Structured Natural Gradient Descent for Deep Learning

09/21/2021
by   Weihua Liu, et al.
0

Natural gradient descent (NGD) provided deep insights and powerful tools to deep neural networks. However the computation of Fisher information matrix becomes more and more difficult as the network structure turns large and complex. This paper proposes a new optimization method whose main idea is to accurately replace the natural gradient optimization by reconstructing the network. More specifically, we reconstruct the structure of the deep neural network, and optimize the new network using traditional gradient descent (GD). The reconstructed network achieves the effect of the optimization way with natural gradient descent. Experimental results show that our optimization method can accelerate the convergence of deep network models and achieve better performance than GD while sharing its computational simplicity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/03/2018

Optimization Algorithm Inspired Deep Neural Network Structure Design

Deep neural networks have been one of the dominant machine learning appr...
research
01/12/2021

A SOM-based Gradient-Free Deep Learning Method with Convergence Analysis

As gradient descent method in deep learning causes a series of questions...
research
06/22/2023

Iteratively Preconditioned Gradient-Descent Approach for Moving Horizon Estimation Problems

Moving horizon estimation (MHE) is a widely studied state estimation app...
research
12/16/2014

Sparse, guided feature connections in an Abstract Deep Network

We present a technique for developing a network of re-used features, whe...
research
04/01/2019

Benchmarking Approximate Inference Methods for Neural Structured Prediction

Exact structured inference with neural network scoring functions is comp...
research
11/05/2018

PILAE: A Non-gradient Descent Learning Scheme for Deep Feedforward Neural Networks

In this work, a non-gradient descent learning scheme is proposed for dee...
research
12/12/2019

Adaptive Reticulum

Neural Networks and Random Forests: two popular techniques for supervise...

Please sign up or login with your details

Forgot password? Click here to reset