Weighted Residuals for Very Deep Networks

05/28/2016
by   Falong Shen, et al.
0

Deep residual networks have recently shown appealing performance on many challenging computer vision tasks. However, the original residual structure still has some defects making it difficult to converge on very deep networks. In this paper, we introduce a weighted residual network to address the incompatibility between ReLU and element-wise addition and the deep network initialization problem. The weighted residual network is able to learn to combine residuals from different layers effectively and efficiently. The proposed models enjoy a consistent improvement over accuracy and convergence with increasing depths from 100+ layers to 1000+ layers. Besides, the weighted residual networks have little more computation and GPU memory burden than the original residual networks. The networks are optimized by projected stochastic gradient descent. Experiments on CIFAR-10 have shown that our algorithm has a faster convergence speed than the original residual networks and reaches a high accuracy at 95.3% with a 1192-layer model.

READ FULL TEXT

page 6

page 7

page 9

page 12

research
11/02/2019

Global Convergence of Gradient Descent for Deep Linear Residual Networks

We analyze the global convergence of gradient descent for deep linear re...
research
03/22/2018

Residual Networks: Lyapunov Stability and Convex Decomposition

While training error of most deep neural networks degrades as the depth ...
research
06/10/2019

Network Implosion: Effective Model Compression for ResNets via Static Layer Pruning and Retraining

Residual Networks with convolutional layers are widely used in the field...
research
08/09/2016

Residual Networks of Residual Networks: Multilevel Residual Networks

A residual-networks family with hundreds or even thousands of layers dom...
research
01/27/2023

Deep Residual Compensation Convolutional Network without Backpropagation

PCANet and its variants provided good accuracy results for classificatio...
research
12/31/2018

Deep Residual Learning in the JPEG Transform Domain

We introduce a general method of performing Residual Network inference a...
research
12/29/2018

Autoencoder Based Residual Deep Networks for Robust Regression Prediction and Spatiotemporal Estimation

To have a superior generalization, a deep learning neural network often ...

Please sign up or login with your details

Forgot password? Click here to reset