Elastic Neural Networks for Classification

10/01/2018
by   Yi Zhou, et al.
14

In this work we propose a framework for improving the performance of any deep neural network that may suffer from vanishing gradients. To address the vanishing gradient issue, we study a framework, where we insert an intermediate output branch after each layer in the computational graph and use the corresponding prediction loss for feeding the gradient to the early layers. The framework - which we name Elastic network - is tested with several well-known networks on CIFAR10 and CIFAR100 datasets, and the experimental results show that the proposed framework improves the accuracy on both shallow networks (e.g., MobileNet) and deep convolutional neural networks (e.g., DenseNet). We also identify the types of networks where the framework does not improve the performance and discuss the reasons. Finally, as a side product, the computational complexity of the resulting networks can be adjusted in an elastic manner by selecting the output branch according to current computational budget.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/02/2018

Elastic Neural Networks: A Scalable Framework for Embedded Computer Vision

We propose a new framework for image classification with deep neural net...
research
06/16/2020

Gradient Amplification: An efficient way to train deep neural networks

Improving performance of deep learning models and reducing their trainin...
research
05/04/2021

Multipath Graph Convolutional Neural Networks

Graph convolution networks have recently garnered a lot of attention for...
research
11/21/2019

Volume-preserving Neural Networks: A Solution to the Vanishing Gradient Problem

We propose a novel approach to addressing the vanishing (or exploding) g...
research
06/24/2022

SCAI: A Spectral data Classification framework with Adaptive Inference for the IoT platform

Currently, it is a hot research topic to realize accurate, efficient, an...
research
05/29/2023

Intelligent gradient amplification for deep neural networks

Deep learning models offer superior performance compared to other machin...
research
04/27/2020

Why should we add early exits to neural networks?

Deep neural networks are generally designed as a stack of differentiable...

Please sign up or login with your details

Forgot password? Click here to reset