Improved Residual Networks for Image and Video Recognition

04/10/2020
by   Ionut Cosmin Duta, et al.
4

Residual networks (ResNets) represent a powerful type of convolutional neural network (CNN) architecture, widely adopted and used in various tasks. In this work we propose an improved version of ResNets. Our proposed improvements address all three main components of a ResNet: the flow of information through the network layers, the residual building block, and the projection shortcut. We are able to show consistent improvements in accuracy and learning convergence over the baseline. For instance, on ImageNet dataset, using the ResNet with 50 layers, for top-1 accuracy we can report a 1.19 over the baseline in one setting and around 2 these improvements are obtained without increasing the model complexity. Our proposed approach allows us to train extremely deep networks, while the baseline shows severe optimization issues. We report results on three tasks over six datasets: image classification (ImageNet, CIFAR-10 and CIFAR-100), object detection (COCO) and video action recognition (Kinetics-400 and Something-Something-v2). In the deep learning era, we establish a new milestone for the depth of a CNN. We successfully train a 404-layer deep CNN on the ImageNet dataset and a 3002-layer network on CIFAR-10 and CIFAR-100, while the baseline is not able to converge at such extreme depths. Code is available at: https://github.com/iduta/iresnet

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/23/2016

Wide Residual Networks

Deep residual networks were shown to be able to scale up to thousands of...
research
04/05/2018

Learning Strict Identity Mappings in Deep Residual Networks

A family of super deep networks, referred to as residual networks or Res...
research
06/21/2023

Efficient ResNets: Residual Network Design

ResNets (or Residual Networks) are one of the most commonly used models ...
research
07/23/2022

SSBNet: Improving Visual Recognition Efficiency by Adaptive Sampling

Downsampling is widely adopted to achieve a good trade-off between accur...
research
12/10/2015

Deep Residual Learning for Image Recognition

Deeper neural networks are more difficult to train. We present a residua...
research
03/15/2023

DeepMIM: Deep Supervision for Masked Image Modeling

Deep supervision, which involves extra supervisions to the intermediate ...

Please sign up or login with your details

Forgot password? Click here to reset