4-Connected Shift Residual Networks

10/22/2019
by   Pascal Mettes, et al.
0

The shift operation was recently introduced as an alternative to spatial convolutions. The operation moves subsets of activations horizontally and/or vertically. Spatial convolutions are then replaced with shift operations followed by point-wise convolutions, significantly reducing computational costs. In this work, we investigate how shifts should best be applied to high accuracy CNNs. We apply shifts of two different neighbourhood groups to ResNet on ImageNet: the originally introduced 8-connected (8C) neighbourhood shift and the less well studied 4-connected (4C) neighbourhood shift. We find that when replacing ResNet's spatial convolutions with shifts, both shift neighbourhoods give equal ImageNet accuracy, showing the sufficiency of small neighbourhoods for large images. Interestingly, when incorporating shifts to all point-wise convolutions in residual networks, 4-connected shifts outperform 8-connected shifts. Such a 4-connected shift setup gives the same accuracy as full residual networks while reducing the number of parameters and FLOPs by over 40 highlight that without spatial convolutions, ResNet's downsampling/upsampling bottleneck channel structure is no longer needed. We show a new, 4C shift-based residual network, much shorter than the original ResNet yet with a higher accuracy for the same computational cost. This network is the highest accuracy shift-based network yet shown, demonstrating the potential of shifting in deep neural networks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/22/2017

Shift: A Zero FLOP, Zero Parameter Alternative to Spatial Convolutions

Neural networks rely on convolutions to aggregate spatial information. H...
research
09/05/2019

Powerset Convolutional Neural Networks

We present a novel class of convolutional neural networks (CNNs) for set...
research
09/22/2018

Shift-based Primitives for Efficient Convolutional Neural Networks

We propose a collection of three shift-based primitives for building eff...
research
01/11/2023

Deep Residual Axial Networks

While residual networks (ResNets) demonstrate outstanding performance on...
research
03/13/2019

All You Need is a Few Shifts: Designing Efficient Convolutional Neural Networks for Image Classification

Shift operation is an efficient alternative over depthwise separable con...
research
11/28/2017

Learning Spatio-Temporal Representation with Pseudo-3D Residual Networks

Convolutional Neural Networks (CNN) have been regarded as a powerful cla...
research
06/27/2023

Easing Color Shifts in Score-Based Diffusion Models

Generated images of score-based models can suffer from errors in their s...

Please sign up or login with your details

Forgot password? Click here to reset