Streaming Coresets for Symmetric Tensor Factorization

06/01/2020
by   Rachit Chhaya, et al.
2

Factorizing tensors has recently become an important optimization module in a number of machine learning pipelines, especially in latent variable models. We show how to do this efficiently in the streaming setting. Given a set of n vectors, each in ℝ^d, we present algorithms to select a sublinear number of these vectors as coreset, while guaranteeing that the CP decomposition of the p-moment tensor of the coreset approximates the corresponding decomposition of the p-moment tensor computed from the full data. We introduce two novel algorithmic techniques: online filtering and kernelization. Using these two, we present four algorithms that achieve different tradeoffs of coreset size, update time and working space, beating or matching various state of the art algorithms. In case of matrices (2-ordered tensor) our online row sampling algorithm guarantees (1 ±ϵ) relative error spectral approximation. We show applications of our algorithms in learning single topic modeling.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/14/2015

Fast and Guaranteed Tensor Decomposition via Sketching

Tensor CANDECOMP/PARAFAC (CP) decomposition has wide applications in sta...
research
09/03/2013

Online Tensor Methods for Learning Latent Variable Models

We introduce an online tensor decomposition based approach for two laten...
research
08/07/2018

Parallel and Streaming Algorithms for K-Core Decomposition

The k-core decomposition is a fundamental primitive in many machine lear...
research
05/28/2023

Fast and Accurate Dual-Way Streaming PARAFAC2 for Irregular Tensors – Algorithm and Application

How can we efficiently and accurately analyze an irregular tensor in a d...
research
10/07/2022

Sampling-Based Decomposition Algorithms for Arbitrary Tensor Networks

We show how to develop sampling-based alternating least squares (ALS) al...
research
07/14/2020

Streaming Probabilistic Deep Tensor Factorization

Despite the success of existing tensor factorization methods, most of th...
research
07/07/2015

Rethinking LDA: moment matching for discrete ICA

We consider moment matching techniques for estimation in Latent Dirichle...

Please sign up or login with your details

Forgot password? Click here to reset