NeuKron: Constant-Size Lossy Compression of Sparse Reorderable Matrices and Tensors

02/09/2023
by   Taehyung Kwon, et al.
0

Many real-world data are naturally represented as a sparse reorderable matrix, whose rows and columns can be arbitrarily ordered (e.g., the adjacency matrix of a bipartite graph). Storing a sparse matrix in conventional ways requires an amount of space linear in the number of non-zeros, and lossy compression of sparse matrices (e.g., Truncated SVD) typically requires an amount of space linear in the number of rows and columns. In this work, we propose NeuKron for compressing a sparse reorderable matrix into a constant-size space. NeuKron generalizes Kronecker products using a recurrent neural network with a constant number of parameters. NeuKron updates the parameters so that a given matrix is approximated by the product and reorders the rows and columns of the matrix to facilitate the approximation. The updates take time linear in the number of non-zeros in the input matrix, and the approximation of each entry can be retrieved in logarithmic time. We also extend NeuKron to compress sparse reorderable tensors (e.g. multi-layer graphs), which generalize matrices. Through experiments on ten real-world datasets, we show that NeuKron is (a) Compact: requiring up to five orders of magnitude less space than its best competitor with similar approximation errors, (b) Accurate: giving up to 10x smaller approximation error than its best competitors with similar size outputs, and (c) Scalable: successfully compressing a matrix with over 230 million non-zero entries.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/19/2023

TensorCodec: Compact Lossy Compression of Tensors without Strong Data Assumptions

Many real-world datasets are represented as tensors, i.e., multi-dimensi...
research
09/16/2021

Sparse Factorization of Large Square Matrices

Square matrices appear in many machine learning problems and models. Opt...
research
07/11/2019

Schatten Norms in Matrix Streams: Hello Sparsity, Goodbye Dimension

The spectrum of a matrix contains important structural information about...
research
11/03/2020

Near-Optimal Entrywise Sampling of Numerically Sparse Matrices

Many real-world data sets are sparse or almost sparse. One method to mea...
research
09/08/2020

Approximate Multiplication of Sparse Matrices with Limited Space

Approximate matrix multiplication with limited space has received ever-i...
research
07/02/2018

Multi-distance Support Matrix Machines

Real-world data such as digital images, MRI scans and electroencephalogr...
research
03/02/2021

Task-parallel in-situ temporal compression of large-scale computational fluid dynamics data

Present day computational fluid dynamics simulations generate extremely ...

Please sign up or login with your details

Forgot password? Click here to reset