Integrating Circle Kernels into Convolutional Neural Networks

07/06/2021
by   Kun He, et al.
11

The square kernel is a standard unit for contemporary Convolutional Neural Networks (CNNs), as it fits well on the tensor computation for the convolution operation. However, the receptive field in the human visual system is actually isotropic like a circle. Motivated by this observation, we propose using circle kernels with isotropic receptive fields for the convolution, and our training takes approximately equivalent amount of calculation when compared with the corresponding CNN with square kernels. Our preliminary experiments demonstrate the rationality of circle kernels. We then propose a kernel boosting strategy that integrates the circle kernels with square kernels for the training and inference, and we further let the kernel size/radius be learnable during the training. Note that we reparameterize the circle kernels or integrated kernels before the inference, thus taking no extra computation as well as the number of parameter overhead for the testing. Extensive experiments on several standard datasets, ImageNet, CIFAR-10 and CIFAR-100, using the circle kernels or integrated kernels on typical existing CNNs, show that our approach exhibits highly competitive performance. Specifically, on ImageNet with standard data augmentation, our approach dramatically boosts the performance of MobileNetV3-Small by 5.20 the performance of MobileNetV3-Large by 2.16 accuracy.

READ FULL TEXT
research
11/23/2022

Rega-Net:Retina Gabor Attention for Deep Convolutional Neural Networks

Extensive research works demonstrate that the attention mechanism in con...
research
02/09/2023

Gaussian Mask Convolution for Convolutional Neural Networks

Square convolution is a default unit in convolutional neural networks as...
research
06/17/2014

Replicating Kernels with a Short Stride Allows Sparse Reconstructions with Fewer Independent Kernels

In sparse coding it is common to tile an image into nonoverlapping patch...
research
05/28/2019

Network Deconvolution

Convolution is a central operation in Convolutional Neural Networks (CNN...
research
08/11/2019

ACNet: Strengthening the Kernel Skeletons for Powerful CNN via Asymmetric Convolution Blocks

As designing appropriate Convolutional Neural Network (CNN) architecture...
research
10/15/2021

FlexConv: Continuous Kernel Convolutions with Differentiable Kernel Sizes

When designing Convolutional Neural Networks (CNNs), one must select the...
research
10/02/2019

A Pre-defined Sparse Kernel Based Convolutionfor Deep CNNs

The high demand for computational and storage resources severely impede ...

Please sign up or login with your details

Forgot password? Click here to reset