Towards Practical Control of Singular Values of Convolutional Layers

11/24/2022
by   Alexandra Senderovich, et al.
0

In general, convolutional neural networks (CNNs) are easy to train, but their essential properties, such as generalization error and adversarial robustness, are hard to control. Recent research demonstrated that singular values of convolutional layers significantly affect such elusive properties and offered several methods for controlling them. Nevertheless, these methods present an intractable computational challenge or resort to coarse approximations. In this paper, we offer a principled approach to alleviating constraints of the prior art at the expense of an insignificant reduction in layer expressivity. Our method is based on the tensor-train decomposition; it retains control over the actual singular values of convolutional mappings while providing structurally sparse and hardware-friendly representation. We demonstrate the improved properties of modern CNNs with our method and analyze its impact on the model performance, calibration, and adversarial robustness. The source code is available at: https://github.com/WhiteTeaDragon/practical_svd_conv

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/17/2020

Large Norms of CNN Layers Do Not Hurt Adversarial Robustness

Since the Lipschitz properties of convolutional neural network (CNN) are...
research
05/26/2018

The Singular Values of Convolutional Layers

We characterize the singular values of the linear transformation associa...
research
05/25/2023

Efficient Bound of Lipschitz Constant for Convolutional Layers by Gram Iteration

Since the control of the Lipschitz constant has a great impact on the tr...
research
11/15/2022

Improved techniques for deterministic l2 robustness

Training convolutional neural networks (CNNs) with a strict 1-Lipschitz ...
research
04/14/2021

Orthogonalizing Convolutional Layers with the Cayley Transform

Recent work has highlighted several advantages of enforcing orthogonalit...
research
04/11/2021

TedNet: A Pytorch Toolkit for Tensor Decomposition Networks

Tensor Decomposition Networks(TDNs) prevail for their inherent compact a...
research
08/03/2023

Design Tasks and Their Complexity for Hybrid Level 3 of the European Train Control System

Railway networks have become increasingly important in recent times, esp...

Please sign up or login with your details

Forgot password? Click here to reset