Designing Network Design Strategies Through Gradient Path Analysis

11/09/2022
by   Chien-Yao Wang, et al.
0

Designing a high-efficiency and high-quality expressive network architecture has always been the most important research topic in the field of deep learning. Most of today's network design strategies focus on how to integrate features extracted from different layers, and how to design computing units to effectively extract these features, thereby enhancing the expressiveness of the network. This paper proposes a new network design strategy, i.e., to design the network architecture based on gradient path analysis. On the whole, most of today's mainstream network design strategies are based on feed forward path, that is, the network architecture is designed based on the data path. In this paper, we hope to enhance the expressive ability of the trained model by improving the network learning ability. Due to the mechanism driving the network parameter learning is the backward propagation algorithm, we design network design strategies based on back propagation path. We propose the gradient path design strategies for the layer-level, the stage-level, and the network-level, and the design strategies are proved to be superior and feasible from theoretical analysis and experiments.

READ FULL TEXT
research
06/17/2021

Backward Gradient Normalization in Deep Neural Networks

We introduce a new technique for gradient normalization during neural ne...
research
07/03/2019

Neural Network Architecture Search with Differentiable Cartesian Genetic Programming for Regression

The ability to design complex neural network architectures which enable ...
research
11/29/2016

Learning Filter Banks Using Deep Learning For Acoustic Signals

Designing appropriate features for acoustic event recognition tasks is a...
research
05/23/2017

Input Fast-Forwarding for Better Deep Learning

This paper introduces a new architectural framework, known as input fast...
research
01/28/2021

Improving Neural Network Robustness through Neighborhood Preserving Layers

Robustness against adversarial attack in neural networks is an important...
research
10/19/2020

DQN-AF: Deep Q-Network based Adaptive Forwarding Strategy for Named Data Networking

NDN has gained significant attention due to the appearance of several un...
research
04/12/2022

How to design a network architecture using capacity planning

Building a network architecture must answer to organization needs, but a...

Please sign up or login with your details

Forgot password? Click here to reset