m-RevNet: Deep Reversible Neural Networks with Momentum

08/12/2021
by   Duo Li, et al.
0

In recent years, the connections between deep residual networks and first-order Ordinary Differential Equations (ODEs) have been disclosed. In this work, we further bridge the deep neural architecture design with the second-order ODEs and propose a novel reversible neural network, termed as m-RevNet, that is characterized by inserting momentum update to residual blocks. The reversible property allows us to perform backward pass without access to activation values of the forward pass, greatly relieving the storage burden during training. Furthermore, the theoretical foundation based on second-order ODEs grants m-RevNet with stronger representational power than vanilla residual networks, which potentially explains its performance gains. For certain learning scenarios, we analytically and empirically reveal that our m-RevNet succeeds while standard ResNet fails. Comprehensive experiments on various image classification and semantic segmentation benchmarks demonstrate the superiority of our m-RevNet over ResNet, concerning both memory efficiency and recognition performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/15/2021

Momentum Residual Neural Networks

The training of deep residual neural networks (ResNets) with backpropaga...
research
05/24/2019

Robust learning with implicit residual networks

In this effort we propose a new deep architecture utilizing residual blo...
research
11/02/2018

Invertible Residual Networks

Reversible deep networks provide useful theoretical guarantees and have ...
research
01/26/2022

Momentum Capsule Networks

Capsule networks are a class of neural networks that achieved promising ...
research
09/12/2017

Reversible Architectures for Arbitrarily Deep Residual Neural Networks

Recently, deep residual networks have been successfully applied in many ...
research
09/18/2022

Semantic Segmentation using Neural Ordinary Differential Equations

The idea of neural Ordinary Differential Equations (ODE) is to approxima...
research
03/28/2021

Rethinking ResNets: Improved Stacking Strategies With High Order Schemes

Various Deep Neural Network architectures are keeping massive vital reco...

Please sign up or login with your details

Forgot password? Click here to reset