Rethinking ResNets: Improved Stacking Strategies With High Order Schemes

03/28/2021
by   Zhengbo Luo, et al.
0

Various Deep Neural Network architectures are keeping massive vital records in computer vision. While drawing attention worldwide, the design of the overall structure somehow lacks general guidance. Based on the relationship between DNN design with numerical differential equations, which several researchers observed in recent years, we perform a fair comparison of residual design with higher-order perspectives. We show that the widely used DNN design strategy, constantly stacking a small design, could be easily improved, supported by solid theoretical knowledge and no extra parameters needed. We reorganize the residual design in higher-order ways, which is inspired by the observation that many effective networks could be interpreted as different numerical discretizations of differential equations. The design of ResNet follows a relatively simple scheme which is Euler forward; however, the situation is getting complicated rapidly while stacking. We suppose stacked ResNet is somehow equalled to a higher order scheme, then the current way of forwarding propagation might be relatively weak compared with a typical high-order method like Runge-Kutta. We propose higher order ResNet to verify the hypothesis on widely used CV benchmarks with sufficient experiments. Stable and noticeable rises in performance are observed, convergence and robustness are benefited.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/22/2020

Numerical Analysis of a High-Order Scheme for Nonlinear Fractional Differential Equations with Uniform Accuracy

We introduce a high-order numerical scheme for fractional ordinary diffe...
research
10/27/2017

Beyond Finite Layer Neural Networks: Bridging Deep Architectures and Numerical Differential Equations

In our work, we bridge deep neural network design with numerical differe...
research
10/02/2020

High-order combined Multi-step Scheme for solving forward Backward Stochastic Differential Equations

In this work, in order to obtain higher-order schemes for solving forwar...
research
01/10/2021

Accuracy and Architecture Studies of Residual Neural Network solving Ordinary Differential Equations

In this paper we consider utilizing a residual neural network (ResNet) t...
research
04/17/2023

A novel higher-order numerical method for parabolic integro-fractional differential equations based on wavelets and L2-1_σ scheme

This paper aims to construct an efficient and highly accurate numerical ...
research
02/28/2019

Towards Robust ResNet: A Small Step but A Giant Leap

This paper presents a simple yet principled approach to boosting the rob...
research
08/12/2021

m-RevNet: Deep Reversible Neural Networks with Momentum

In recent years, the connections between deep residual networks and firs...

Please sign up or login with your details

Forgot password? Click here to reset