Canonical convolutional neural networks

06/03/2022
by   Moritz Wolter, et al.
0

We introduce canonical weight normalization for convolutional neural networks. Inspired by the canonical tensor decomposition, we express the weight tensors in so-called canonical networks as scaled sums of outer vector products. In particular, we train network weights in the decomposed form, where scale weights are optimized separately for each mode. Additionally, similarly to weight normalization, we include a global scaling parameter. We study the initialization of the canonical form by running the power method and by drawing randomly from Gaussian or uniform distributions. Our results indicate that we can replace the power method with cheaper initializations drawn from standard distributions. The canonical re-parametrization leads to competitive normalization performance on the MNIST, CIFAR10, and SVHN data sets. Moreover, the formulation simplifies network compression. Once training has converged, the canonical form allows convenient model-compression by truncating the parameter sums.

READ FULL TEXT
research
04/03/2017

Dictionary-based Tensor Canonical Polyadic Decomposition

To ensure interpretability of extracted sources in tensor decomposition,...
research
12/07/2021

Variance-Aware Weight Initialization for Point Convolutional Neural Networks

Appropriate weight initialization has been of key importance to successf...
research
08/08/2022

Understanding Weight Similarity of Neural Networks via Chain Normalization Rule and Hypothesis-Training-Testing

We present a weight similarity measure method that can quantify the weig...
research
08/04/2021

Gohberg-Kaashoek Numbers and Stability of the Schur Canonical Form

In the present paper, we characterize the stability of the Schur canonic...
research
06/19/2022

FRAPPE: Fast Rank Approximation with Explainable Features for Tensors

Tensor decompositions have proven to be effective in analyzing the struc...
research
07/31/2018

Scale equivariance in CNNs with vector fields

We study the effect of injecting local scale equivariance into Convoluti...

Please sign up or login with your details

Forgot password? Click here to reset