Frequency Pooling: Shift-Equivalent and Anti-Aliasing Downsampling

09/24/2021
by   Zhendong Zhang, et al.
0

Convolution utilizes a shift-equivalent prior of images, thus leading to great success in image processing tasks. However, commonly used poolings in convolutional neural networks (CNNs), such as max-pooling, average-pooling, and strided-convolution, are not shift-equivalent. Thus, the shift-equivalence of CNNs is destroyed when convolutions and poolings are stacked. Moreover, anti-aliasing is another essential property of poolings from the perspective of signal processing. However, recent poolings are neither shift-equivalent nor anti-aliasing. To address this issue, we propose a new pooling method that is shift-equivalent and anti-aliasing, named frequency pooling. Frequency pooling first transforms the features into the frequency domain, and then removes the frequency components beyond the Nyquist frequency. Finally, it transforms the features back to the spatial domain. We prove that frequency pooling is shift-equivalent and anti-aliasing based on the property of Fourier transform and Nyquist frequency. Experiments on image classification show that frequency pooling improves accuracy and robustness with respect to the shifts of CNNs.

READ FULL TEXT
research
04/25/2019

Making Convolutional Networks Shift-Invariant Again

Modern convolutional networks are not shift-invariant, as small input sh...
research
02/15/2021

How Convolutional Neural Networks Deal with Aliasing

The convolutional neural network (CNN) remains an essential tool in solv...
research
12/24/2020

On the Conditions of Absorption Property for Morphological Opening and Closing

This paper aims to establish the theoretical foundation for shift inclus...
research
06/28/2020

Frequency learning for image classification

Machine learning applied to computer vision and signal processing is ach...
research
05/09/2021

Truly shift-equivariant convolutional neural networks with adaptive polyphase upsampling

Convolutional neural networks lack shift equivariance due to the presenc...
research
01/28/2020

Discrete Signal Processing with Set Functions

Set functions are functions (or signals) indexed by the power set (set o...
research
09/19/2022

On the Shift Invariance of Max Pooling Feature Maps in Convolutional Neural Networks

In this paper, we aim to improve the mathematical interpretability of co...

Please sign up or login with your details

Forgot password? Click here to reset