Practical Leverage-Based Sampling for Low-Rank Tensor Decomposition

06/30/2020
by   Brett W. Larsen, et al.
0

Conventional algorithms for finding low-rank canonical polyadic (CP) tensor decompositions are unwieldy for large sparse tensors. The CP decomposition can be computed by solving a sequence of overdetermined least problems with special Khatri-Rao structure. In this work, we present an application of randomized numerical linear algebra to fitting the CP decomposition of sparse tensors, solving a significantly smaller sampled least squares problem at each iteration with probabilistic guarantees on the approximation errors. Prior work has shown that sketching is effective in the dense case, but the prior approach cannot be applied to the sparse case because a fast Johnson-Lindenstrauss transform (e.g., using a fast Fourier transform) must be applied in each mode, causing the sparse tensor to become dense. Instead, we perform sketching through leverage score sampling, crucially relying on the fact that the structure of the Khatri-Rao product allows sampling from overestimates of the leverage scores without forming the full product or the corresponding probabilities. Naive application of leverage score sampling is ineffective because we often have cases where a few scores are quite large, leading to repeatedly sampling the few entries with large scores. We improve the speed by combining repeated rows. Additionally, we propose a novel hybrid of deterministic and random leverage-score sampling which consistently yields improved fits. Numerical results on real-world large-scale tensors show the method is significantly faster than competing methods without sacrificing accuracy.

READ FULL TEXT
research
04/02/2021

Fast and Accurate Randomized Algorithms for Low-rank Tensor Decompositions

Low-rank Tucker and CP tensor decompositions are powerful tools in data ...
research
07/18/2022

Tensor Decompositions for Count Data that Leverage Stochastic and Deterministic Optimization

There is growing interest to extend low-rank matrix decompositions to mu...
research
10/11/2022

Distributed-Memory Randomized Algorithms for Sparse Tensor CP Decomposition

Low-rank Candecomp / PARAFAC (CP) Decomposition is a powerful tool for t...
research
09/20/2017

Near Optimal Sketching of Low-Rank Tensor Regression

We study the least squares regression problem _Θ∈S_ D,RAΘ-b_2, where S_...
research
01/29/2023

Fast Exact Leverage Score Sampling from Khatri-Rao Products with Applications to Tensor Decomposition

We present a data structure to randomly sample rows from the Khatri-Rao ...
research
05/23/2021

Estimating leverage scores via rank revealing methods and randomization

We study algorithms for estimating the statistical leverage scores of re...
research
12/17/2019

Lower Memory Oblivious (Tensor) Subspace Embeddings with Fewer Random Bits: Modewise Methods for Least Squares

In this paper new general modewise Johnson-Lindenstrauss (JL) subspace e...

Please sign up or login with your details

Forgot password? Click here to reset