A network that learns Strassen multiplication

01/26/2016
by   Veit Elser, et al.
0

We study neural networks whose only non-linear components are multipliers, to test a new training rule in a context where the precise representation of data is paramount. These networks are challenged to discover the rules of matrix multiplication, given many examples. By limiting the number of multipliers, the network is forced to discover the Strassen multiplication rules. This is the mathematical equivalent of finding low rank decompositions of the n× n matrix multiplication tensor, M_n. We train these networks with the conservative learning rule, which makes minimal changes to the weights so as to give the correct output for each input at the time the input-output pair is received. Conservative learning needs a few thousand examples to find the rank 7 decomposition of M_2, and 10^5 for the rank 23 decomposition of M_3 (the lowest known). High precision is critical, especially for M_3, to discriminate between true decompositions and "border approximations".

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/02/2018

The geometry of rank decompositions of matrix multiplication II: 3× 3 matrices

This is the second in a series of papers on rank decompositions of the m...
research
02/11/2019

Equivalent Polyadic Decompositions of Matrix Multiplication Tensors

Invariance transformations of polyadic decompositions of matrix multipli...
research
11/29/2017

Energy-Efficient Time-Domain Vector-by-Matrix Multiplier for Neurocomputing and Beyond

We propose an extremely energy-efficient mixed-signal approach for perfo...
research
11/18/2019

New lower bounds for matrix multiplication and the 3x3 determinant

Let M_〈 u,v,w〉∈ C^uv⊗ C^vw⊗ C^wu denote the matrix multiplication tensor...
research
05/11/2017

Cache-oblivious Matrix Multiplication for Exact Factorisation

We present a cache-oblivious adaptation of matrix multiplication to be i...
research
10/17/2022

Data-driven Modeling of Mach-Zehnder Interferometer-based Optical Matrix Multipliers

Photonic integrated circuits are facilitating the development of optical...
research
09/24/2017

Tensor-Based Classifiers for Hyperspectral Data Analysis

In this work, we present tensor-based linear and nonlinear models for hy...

Please sign up or login with your details

Forgot password? Click here to reset