Fast, Accurate, and Scalable Method for Sparse Coupled Matrix-Tensor Factorization

08/29/2017
by   Dongjin Choi, et al.
0

How can we capture the hidden properties from a tensor and a matrix data simultaneously in a fast, accurate, and scalable way? Coupled matrix-tensor factorization (CMTF) is a major tool to extract latent factors from a tensor and matrices at once. Designing an accurate and efficient CMTF method has become more crucial as the size and dimension of real-world data are growing explosively. However, existing methods for CMTF suffer from lack of accuracy, slow running time, and limited scalability. In this paper, we propose S3CMTF, a fast, accurate, and scalable CMTF method. S3CMTF achieves high speed by exploiting the sparsity of real-world tensors, and high accuracy by capturing inter-relations between factors. Also, S3CMTF accomplishes additional speed-up by lock-free parallel SGD update for multi-core shared memory systems. We present two methods, S3CMTF-naive and S3CMTF-opt. S3CMTF-naive is a basic version of S3CMTF, and S3CMTF-opt improves its speed by exploiting intermediate data. We theoretically and empirically show that S3CMTF is the fastest, outperforming existing methods. Experimental results show that S3CMTF is 11 43 times faster, and 2.1 4.1 times more accurate than existing methods. S3CMTF shows linear scalability on the number of data entries and the number of cores. In addition, we apply S3CMTF to Yelp recommendation tensor data coupled with 3 additional matrices to discover interesting properties.

READ FULL TEXT

page 7

page 9

research
10/06/2017

Scalable Tucker Factorization for Sparse Tensors - Algorithms and Discoveries

Given sparse multi-dimensional data (e.g., (user, movie, time; rating) f...
research
04/04/2019

VeST: Very Sparse Tucker Factorization of Large-Scale Tensors

Given a large tensor, how can we decompose it to sparse core tensor and ...
research
02/28/2013

Scoup-SMT: Scalable Coupled Sparse Matrix-Tensor Factorization

How can we correlate neural activity in the human brain as it responds t...
research
10/09/2017

CTD: Fast, Accurate, and Interpretable Method for Static and Dynamic Tensor Decompositions

How can we find patterns and anomalies in a tensor, or multi-dimensional...
research
03/07/2020

Columnwise Element Selection for Computationally Efficient Nonnegative Coupled Matrix Tensor Factorization

Coupled Matrix Tensor Factorization (CMTF) facilitates the integration a...
research
04/27/2016

Distributed Flexible Nonlinear Tensor Factorization

Tensor factorization is a powerful tool to analyse multi-way data. Compa...
research
02/16/2021

Robust Factorization of Real-world Tensor Streams with Patterns, Missing Values, and Outliers

Consider multiple seasonal time series being collected in real-time, in ...

Please sign up or login with your details

Forgot password? Click here to reset