Scoup-SMT: Scalable Coupled Sparse Matrix-Tensor Factorization

02/28/2013
by   Evangelos E. Papalexakis, et al.
0

How can we correlate neural activity in the human brain as it responds to words, with behavioral data expressed as answers to questions about these same words? In short, we want to find latent variables, that explain both the brain activity, as well as the behavioral responses. We show that this is an instance of the Coupled Matrix-Tensor Factorization (CMTF) problem. We propose Scoup-SMT, a novel, fast, and parallel algorithm that solves the CMTF problem and produces a sparse latent low-rank subspace of the data. In our experiments, we find that Scoup-SMT is 50-100 times faster than a state-of-the-art algorithm for CMTF, along with a 5 fold increase in sparsity. Moreover, we extend Scoup-SMT to handle missing data without degradation of performance. We apply Scoup-SMT to BrainQ, a dataset consisting of a (nouns, brain voxels, human subjects) tensor and a (nouns, properties) matrix, with coupling along the nouns dimension. Scoup-SMT is able to find meaningful latent variables, as well as to predict brain activity with competitive accuracy. Finally, we demonstrate the generality of Scoup-SMT, by applying it on a Facebook dataset (users, friends, wall-postings); there, Scoup-SMT spots spammer-like anomalies.

READ FULL TEXT
research
08/02/2019

CVC4-SymBreak: Derived SMT solver at SMT Competition 2019

We present CVC4-SymBreak, a derived SMT solver based on CVC4, and a non-...
research
08/29/2017

Fast, Accurate, and Scalable Method for Sparse Coupled Matrix-Tensor Factorization

How can we capture the hidden properties from a tensor and a matrix data...
research
03/05/2018

OpenMath and SMT-LIB

OpenMath and SMT-LIB are languages with very different origins, but both...
research
03/07/2020

Columnwise Element Selection for Computationally Efficient Nonnegative Coupled Matrix Tensor Factorization

Coupled Matrix Tensor Factorization (CMTF) facilitates the integration a...
research
10/13/2011

Efficient Latent Variable Graphical Model Selection via Split Bregman Method

We consider the problem of covariance matrix estimation in the presence ...
research
05/23/2023

SMT 2.0: A Surrogate Modeling Toolbox with a focus on Hierarchical and Mixed Variables Gaussian Processes

The Surrogate Modeling Toolbox (SMT) is an open-source Python package th...
research
11/11/2018

Fast Matrix Factorization with Non-Uniform Weights on Missing Data

Matrix factorization (MF) has been widely used to discover the low-rank ...

Please sign up or login with your details

Forgot password? Click here to reset