Robust Bayesian Tensor Factorization with Zero-Inflated Poisson Model and Consensus Aggregation

08/15/2023
by   Daniel Chafamo, et al.
0

Tensor factorizations (TF) are powerful tools for the efficient representation and analysis of multidimensional data. However, classic TF methods based on maximum likelihood estimation underperform when applied to zero-inflated count data, such as single-cell RNA sequencing (scRNA-seq) data. Additionally, the stochasticity inherent in TFs results in factors that vary across repeated runs, making interpretation and reproducibility of the results challenging. In this paper, we introduce Zero Inflated Poisson Tensor Factorization (ZIPTF), a novel approach for the factorization of high-dimensional count data with excess zeros. To address the challenge of stochasticity, we introduce Consensus Zero Inflated Poisson Tensor Factorization (C-ZIPTF), which combines ZIPTF with a consensus-based meta-analysis. We evaluate our proposed ZIPTF and C-ZIPTF on synthetic zero-inflated count data and synthetic and real scRNA-seq data. ZIPTF consistently outperforms baseline matrix and tensor factorization methods in terms of reconstruction accuracy for zero-inflated data. When the probability of excess zeros is high, ZIPTF achieves up to 2.4× better accuracy. Additionally, C-ZIPTF significantly improves the consistency and accuracy of the factorization. When tested on both synthetic and real scRNA-seq data, ZIPTF and C-ZIPTF consistently recover known and biologically meaningful gene expression programs.

READ FULL TEXT
research
12/12/2017

Zero-Modified Poisson-Lindley distribution with applications in zero-inflated and zero-deflated count data

The main object of this article is to present an extension of the zero-i...
research
10/12/2019

Variational Auto-encoder Based Bayesian Poisson Tensor Factorization for Sparse and Imbalanced Count Data

Non-negative tensor factorization models enable predictive analysis on c...
research
06/10/2015

Bayesian Poisson Tensor Factorization for Inferring Multilateral Relations from Sparse Dyadic Event Counts

We present a Bayesian tensor factorization model for inferring latent gr...
research
10/30/2019

Software defect prediction with zero-inflated Poisson models

In this work we apply several Poisson and zero-inflated models for softw...
research
09/29/2014

A Bayesian Tensor Factorization Model via Variational Inference for Link Prediction

Probabilistic approaches for tensor factorization aim to extract meaning...
research
08/18/2015

Zero-Truncated Poisson Tensor Factorization for Massive Binary Tensors

We present a scalable Bayesian model for low-rank factorization of massi...
research
05/20/2022

Hot-spots Detection in Count Data by Poisson Assisted Smooth Sparse Tensor Decomposition

Count data occur widely in many bio-surveillance and healthcare applicat...

Please sign up or login with your details

Forgot password? Click here to reset