Generalizing the Convolution Operator in Convolutional Neural Networks

07/14/2017
by   Kamaledin Ghiasi-Shirazi, et al.
0

Convolutional neural networks have become a main tool for solving many machine vision and machine learning problems. A major element of these networks is the convolution operator which essentially computes the inner product between a weight vector and the vectorized image patches extracted by sliding a window in the image planes of the previous layer. In this paper, we propose two classes of surrogate functions for the inner product operation inherent in the convolution operator and so attain two generalizations of the convolution operator. The first one is the class of positive definite kernel functions where their application is justified by the kernel trick. The second one is the class of similarity measures defined based on a distance function. We justify this by tracing back to the basic idea behind the neocognitron which is the ancestor of CNNs. Both methods are then further generalized by allowing a monotonically increasing function to be applied subsequently. Like any trainable parameter in a neural network, the template pattern and the parameters of the kernel/distance function are trained with the back-propagation algorithm. As an aside, we use the proposed framework to justify the use of sine activation function in CNNs. Our experiments on the MNIST dataset show that the performance of ordinary CNNs can be achieved by generalized CNNs based on weighted L1/L2 distances, proving the applicability of the proposed generalization of the convolutional neural networks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/20/2021

PowerLinear Activation Functions with application to the first layer of CNNs

Convolutional neural networks (CNNs) have become the state-of-the-art to...
research
04/22/2018

Decoupled Networks

Inner product-based convolution has been a central component of convolut...
research
09/04/2018

Geometric Operator Convolutional Neural Network

The Convolutional Neural Network (CNN) has been successfully applied in ...
research
09/19/2017

Image operator learning coupled with CNN classification and its application to staff line removal

Many image transformations can be modeled by image operators that are ch...
research
10/28/2019

Neural Similarity Learning

Inner product-based convolution has been the founding stone of convoluti...
research
11/08/2017

Deep Hyperspherical Learning

Convolution as inner product has been the founding basis of convolutiona...
research
10/16/2017

Convolutional neural networks for structured omics: OmicsCNN and the OmicsConv layer

Convolutional Neural Networks (CNNs) are a popular deep learning archite...

Please sign up or login with your details

Forgot password? Click here to reset