SamBaTen: Sampling-based Batch Incremental Tensor Decomposition

09/03/2017
by   Ekta Gujral, et al.
0

Tensor decompositions are invaluable tools in analyzing multimodal datasets. In many real-world scenarios, such datasets are far from being static, to the contrary they tend to grow over time. For instance, in an online social network setting, as we observe new interactions over time, our dataset gets updated in its "time" mode. How can we maintain a valid and accurate tensor decomposition of such a dynamically evolving multimodal dataset, without having to re-compute the entire decomposition after every single update? In this paper we introduce SaMbaTen, a Sampling-based Batch Incremental Tensor Decomposition algorithm, which incrementally maintains the decomposition given new updates to the tensor dataset. SaMbaTen is able to scale to datasets that the state-of-the-art in incremental tensor decomposition is unable to operate on, due to its ability to effectively summarize the existing tensor and the incoming updates, and perform all computations in the reduced summary space. We extensively evaluate SaMbaTen using synthetic and real datasets. Indicatively, SaMbaTen achieves comparable accuracy to state-of-the-art incremental and non-incremental techniques, while being 25-30 times faster. Furthermore, SaMbaTen scales to very large sparse and dense dynamically evolving tensors of dimensions up to 100K x 100K x 100K where state-of-the-art incremental approaches were not able to operate.

READ FULL TEXT
research
06/11/2017

DenseAlert: Incremental Dense-Subtensor Detection in Tensor Streams

Consider a stream of retweet events - how can we spot fraudulent lock-st...
research
08/28/2019

Streaming and Batch Algorithms for Truss Decomposition

Truss decomposition is a method used to analyze large sparse graphs in o...
research
06/15/2021

Augmented Tensor Decomposition with Stochastic Optimization

Tensor decompositions are powerful tools for dimensionality reduction an...
research
05/26/2023

Sentence-Incremental Neural Coreference Resolution

We propose a sentence-incremental neural coreference resolution system w...
research
10/09/2017

CTD: Fast, Accurate, and Interpretable Method for Static and Dynamic Tensor Decompositions

How can we find patterns and anomalies in a tensor, or multi-dimensional...
research
10/10/2022

Modeling and Mining Multi-Aspect Graphs With Scalable Streaming Tensor Decomposition

Graphs emerge in almost every real-world application domain, ranging fro...
research
03/13/2017

SPARTan: Scalable PARAFAC2 for Large & Sparse Data

In exploratory tensor mining, a common problem is how to analyze a set o...

Please sign up or login with your details

Forgot password? Click here to reset