Reversible Architectures for Arbitrarily Deep Residual Neural Networks

09/12/2017
by   Bo Chang, et al.
0

Recently, deep residual networks have been successfully applied in many computer vision and natural language processing tasks, pushing the state-of-the-art performance with deeper and wider architectures. In this work, we interpret deep residual networks as ordinary differential equations (ODEs), which have long been studied in mathematics and physics with rich theoretical and empirical success. From this interpretation, we develop a theoretical framework on stability and reversibility of deep neural networks, and derive three reversible neural network architectures that can go arbitrarily deep in theory. The reversibility property allows a memory-efficient implementation, which does not need to store the activations for most hidden layers. Together with the stability of our architectures, this enables training deeper networks using only modest computational resources. We provide both theoretical analyses and empirical results. Experimental results demonstrate the efficacy of our architectures against several strong baselines on CIFAR-10, CIFAR-100 and STL-10 with superior or on-par state-of-the-art performance. Furthermore, we show our architectures yield superior results when trained using fewer training data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/14/2017

The Reversible Residual Network: Backpropagation Without Storing Activations

Deep residual networks (ResNets) have significantly pushed forward the s...
research
11/02/2018

Invertible Residual Networks

Reversible deep networks provide useful theoretical guarantees and have ...
research
06/22/2022

Neural Networks as Paths through the Space of Representations

Deep neural networks implement a sequence of layer-by-layer operations t...
research
05/09/2017

Stable Architectures for Deep Neural Networks

Deep neural networks have become invaluable tools for supervised machine...
research
08/12/2021

m-RevNet: Deep Reversible Neural Networks with Momentum

In recent years, the connections between deep residual networks and firs...
research
02/15/2021

Momentum Residual Neural Networks

The training of deep residual neural networks (ResNets) with backpropaga...
research
04/16/2021

Polynomial Networks in Deep Classifiers

Deep neural networks have been the driving force behind the success in c...

Please sign up or login with your details

Forgot password? Click here to reset