Scalable Tucker Factorization for Sparse Tensors - Algorithms and Discoveries

10/06/2017
by   Sejoon Oh, et al.
0

Given sparse multi-dimensional data (e.g., (user, movie, time; rating) for movie recommendations), how can we discover latent concepts/relations and predict missing values? Tucker factorization has been widely used to solve such problems with multi-dimensional data, which are modeled as tensors. However, most Tucker factorization algorithms regard and estimate missing entries as zeros, which triggers a highly inaccurate decomposition. Moreover, few methods focusing on an accuracy exhibit limited scalability since they require huge memory and heavy computational costs while updating factor matrices. In this paper, we propose P-Tucker, a scalable Tucker factorization method for sparse tensors. P-Tucker performs an alternating least squares with a gradient-based update rule in a fully parallel way, which significantly reduces memory requirements for updating factor matrices. Furthermore, we offer two variants of P-Tucker: a caching algorithm P-Tucker-CACHE and an approximation algorithm P-Tucker-APPROX, both of which accelerate the update process. Experimental results show that P-Tucker exhibits 1.7-14.1x speed-up and 1.4-4.8x less error compared to the state-of-the-art. In addition, P-Tucker scales near linearly with the number of non-zeros in a tensor and number of threads. Thanks to P-Tucker, we successfully discover hidden concepts and relations in a large-scale real-world tensor, while existing methods cannot reveal latent features due to their limited scalability or low accuracy.

READ FULL TEXT
research
08/29/2017

Fast, Accurate, and Scalable Method for Sparse Coupled Matrix-Tensor Factorization

How can we capture the hidden properties from a tensor and a matrix data...
research
04/04/2019

VeST: Very Sparse Tucker Factorization of Large-Scale Tensors

Given a large tensor, how can we decompose it to sparse core tensor and ...
research
08/23/2021

Influence-guided Data Augmentation for Neural Tensor Completion

How can we predict missing values in multi-dimensional data (or tensors)...
research
12/16/2020

Time-Aware Tensor Decomposition for Missing Entry Prediction

Given a time-evolving tensor with missing entries, how can we effectivel...
research
03/07/2020

Efficient Nonnegative Tensor Factorization via Saturating Coordinate Descent

With the advancements in computing technology and web-based applications...
research
02/04/2018

Out-of-Core and Distributed Algorithms for Dense Subtensor Mining

How can we detect fraudulent lockstep behavior in large-scale multi-aspe...
research
02/23/2021

SliceNStitch: Continuous CP Decomposition of Sparse Tensor Streams

Consider traffic data (i.e., triplets in the form of source-destination-...

Please sign up or login with your details

Forgot password? Click here to reset