On regularization for a convolutional kernel in neural networks

06/12/2019
by   Peichang Guo, et al.
0

Convolutional neural network is a very important model of deep learning. It can help avoid the exploding/vanishing gradient problem and improve the generalizability of a neural network if the singular values of the Jacobian of a layer are bounded around 1 in the training process. We propose a new penalty function for a convolutional kernel to let the singular values of the corresponding transformation matrix are bounded around 1. We show how to carry out the gradient type methods. The penalty is about the transformation matrix corresponding to a kernel, not directly about the kernel, which is different from results in existing papers. This provides a new regularization method about the weights of convolutional layers. Other penalty functions about a kernel can be devised following this idea in future.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/25/2019

A Frobenius norm regularization method for convolutional kernels to avoid unstable gradient problem

Convolutional neural network is a very important model of deep learning....
research
02/05/2021

Regularization for convolutional kernel tensors to avoid unstable gradient problem in convolutional neural networks

Convolutional neural networks are very popular nowadays. Training neural...
research
03/19/2019

Kernel-based Translations of Convolutional Networks

Convolutional Neural Networks, as most artificial neural networks, are c...
research
07/25/2019

Filter Bank Regularization of Convolutional Neural Networks

Regularization techniques are widely used to improve the generality, rob...
research
06/10/2018

Transformationally Identical and Invariant Convolutional Neural Networks through Symmetric Element Operators

Mathematically speaking, a transformationally invariant operator, such a...
research
03/23/2016

A guide to convolution arithmetic for deep learning

We introduce a guide to help deep learning practitioners understand and ...
research
08/12/2021

Existence, Stability And Scalability Of Orthogonal Convolutional Neural Networks

Imposing orthogonal transformations between layers of a neural network h...

Please sign up or login with your details

Forgot password? Click here to reset