No More Strided Convolutions or Pooling: A New CNN Building Block for Low-Resolution Images and Small Objects

08/07/2022
by   Raja Sunkara, et al.
91

Convolutional neural networks (CNNs) have made resounding success in many computer vision tasks such as image classification and object detection. However, their performance degrades rapidly on tougher tasks where images are of low resolution or objects are small. In this paper, we point out that this roots in a defective yet common design in existing CNN architectures, namely the use of strided convolution and/or pooling layers, which results in a loss of fine-grained information and learning of less effective feature representations. To this end, we propose a new CNN building block called SPD-Conv in place of each strided convolution layer and each pooling layer (thus eliminates them altogether). SPD-Conv is comprised of a space-to-depth (SPD) layer followed by a non-strided convolution (Conv) layer, and can be applied in most if not all CNN architectures. We explain this new design under two most representative computer vision tasks: object detection and image classification. We then create new CNN architectures by applying SPD-Conv to YOLOv5 and ResNet, and empirically show that our approach significantly outperforms state-of-the-art deep learning models, especially on tougher tasks with low-resolution images and small objects. We have open-sourced our code at https://github.com/LabSAINT/SPD-Conv.

READ FULL TEXT

page 12

page 14

research
03/15/2017

Convolutional Low-Resolution Fine-Grained Classification

Successful fine-grained image classification methods learn subtle detail...
research
10/25/2016

Maxmin convolutional neural networks for image classification

Convolutional neural networks (CNN) are widely used in computer vision, ...
research
01/03/2021

An Evolution of CNN Object Classifiers on Low-Resolution Images

Object classification is a significant task in computer vision. It has b...
research
09/27/2022

CEC-CNN: A Consecutive Expansion-Contraction Convolutional Network for Very Small Resolution Medical Image Classification

Deep Convolutional Neural Networks (CNNs) for image classification succe...
research
09/28/2021

Turning old models fashion again: Recycling classical CNN networks using the Lattice Transformation

In the early 1990s, the first signs of life of the CNN era were given: L...
research
02/09/2021

Deep Multilabel CNN for Forensic Footwear Impression Descriptor Identification

In recent years deep neural networks have become the workhorse of comput...
research
11/27/2019

CSPNet: A New Backbone that can Enhance Learning Capability of CNN

Neural networks have enabled state-of-the-art approaches to achieve incr...

Please sign up or login with your details

Forgot password? Click here to reset