Convolutions Through the Lens of Tensor Networks

07/05/2023
by   Felix Dangel, et al.
0

Despite their simple intuition, convolutions are more tedious to analyze than dense layers, which complicates the generalization of theoretical and algorithmic ideas. We provide a new perspective onto convolutions through tensor networks (TNs) which allow reasoning about the underlying tensor multiplications by drawing diagrams, and manipulating them to perform function transformations, sub-tensor access, and fusion. We demonstrate this expressive power by deriving the diagrams of various autodiff operations and popular approximations of second-order information with full hyper-parameter support, batching, channel groups, and generalization to arbitrary convolution dimensions. Further, we provide convolution-specific transformations based on the connectivity pattern which allow to re-wire and simplify diagrams before evaluation. Finally, we probe computational performance, relying on established machinery for efficient TN contraction. Our TN implementation speeds up a recently-proposed KFAC variant up to 4.5x and enables new hardware-efficient tensor dropout for approximate backpropagation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/21/2021

Rotationally Equivariant Neural Operators for Learning Transformations on Tensor Fields (eg 3D Images and Vector Fields)

We introduce equivariant neural operators for learning resolution invari...
research
11/06/2019

TensorTrace: an application to contract tensor networks

Tensor network methods are a conceptually elegant framework for encoding...
research
04/14/2020

Tensor Network Rewriting Strategies for Satisfiability and Counting

We provide a graphical treatment of SAT and #SAT on equal footing. Insta...
research
02/11/2023

Optimizing Orthogonalized Tensor Deflation via Random Tensor Theory

This paper tackles the problem of recovering a low-rank signal tensor wi...
research
05/21/2018

Faster Neural Network Training with Approximate Tensor Operations

We propose a novel technique for faster Neural Network (NN) training by ...
research
06/29/2017

Tensor-based approach to accelerate deformable part models

This article provides next step towards solving speed bottleneck of any ...

Please sign up or login with your details

Forgot password? Click here to reset