COPA: Constrained PARAFAC2 for Sparse & Large Datasets

03/12/2018
by   Ardavan Afshar, et al.
0

PARAFAC2 has demonstrated success in modeling irregular tensors, where the tensor dimensions vary across one of the modes. An example scenario is jointly modeling treatments across a set of patients with varying number of medical encounters, where the alignment of events in time bears no clinical meaning, and it may also be impossible to align them due to their varying length. Despite recent improvements on scaling up unconstrained PARAFAC2, its model factors are usually dense and sensitive to noise which limits their interpretability. As a result, the following open challenges remain: a) various modeling constraints, such as temporal smoothness, sparsity and non-negativity, are needed to be imposed for interpretable temporal modeling and b) a scalable approach is required to support those constraints efficiently for large datasets. To tackle these challenges, we propose a COnstrained PARAFAC2 (COPA) method, which carefully incorporates optimization constraints such as temporal smoothness, sparsity, and non-negativity in the resulting factors. To efficiently support all those constraints, COPA adopts a hybrid optimization framework using alternating optimization and alternating direction method of multiplier (AO-ADMM). As evaluated on large electronic health record (EHR) datasets with hundreds of thousands of patients, COPA achieves significant speedups (up to 36x faster) over prior PARAFAC2 approaches that only attempt to handle a subset of the constraints that COPA enables. Overall, our method outperforms all the baselines attempting to handle a subset of the constraints in terms of speed, while achieving the same level of accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/13/2017

SPARTan: Scalable PARAFAC2 for Large & Sparse Data

In exploratory tensor mining, a common problem is how to analyze a set o...
research
10/24/2022

PARAFAC2-based Coupled Matrix and Tensor Factorizations

Coupled matrix and tensor factorizations (CMTF) have emerged as an effec...
research
12/16/2020

Time-Aware Tensor Decomposition for Missing Entry Prediction

Given a time-evolving tensor with missing entries, how can we effectivel...
research
05/28/2021

Revitalizing Optimization for 3D Human Pose and Shape Estimation: A Sparse Constrained Formulation

We propose a novel sparse constrained formulation and from it derive a r...
research
04/11/2017

Federated Tensor Factorization for Computational Phenotyping

Tensor factorization models offer an effective approach to convert massi...
research
06/13/2015

A Flexible and Efficient Algorithmic Framework for Constrained Matrix and Tensor Factorization

We propose a general algorithmic framework for constrained matrix and te...
research
10/17/2019

Generalized Mixed Modeling in Massive Electronic Health Record Databases: what is a healthy serum potassium?

Converting electronic health record (EHR) entries to useful clinical inf...

Please sign up or login with your details

Forgot password? Click here to reset