Finite mixtures of matrix-variate Poisson-log normal distributions for three-way count data

07/22/2018
by   Anjali Silva, et al.
0

Three-way data structures, characterized by three entities, the units, the variables and the occasions, are frequent in biological studies. In RNA sequencing, three-way data structures are obtained when high-throughput transcriptome sequencing data are collected for n genes across p conditions at r occasions. Matrix-variate distributions offer a natural way to model three-way data and mixtures of matrix-variate distributions can be used to cluster three-way data. Clustering of gene expression data is carried out as means to discovering gene co-expression networks. In this work, a mixture of matrix-variate Poisson-log normal distributions is proposed for clustering read counts from RNA sequencing. By considering the matrix-variate structure, full information on the conditions and occasions of the RNA sequencing dataset is simultaneously considered, and the number of covariance parameters to be estimated is reduced. A Markov chain Monte Carlo expectation-maximization algorithm is used for parameter estimation and information criteria are used for model selection. The models are applied to both real and simulated data, giving favourable clustering results.

READ FULL TEXT

page 11

page 12

research
11/30/2017

A Multivariate Poisson-Log Normal Mixture Model for Clustering Transcriptome Sequencing Data

High-dimensional data of discrete and skewed nature is commonly encounte...
research
05/08/2020

Mixtures of Contaminated Matrix Variate Normal Distributions

Analysis of three-way data is becoming ever more prevalent in the litera...
research
04/15/2020

A parsimonious family of multivariate Poisson-lognormal distributions for clustering multivariate count data

Multivariate count data are commonly encountered through high-throughput...
research
06/02/2019

Clustering Multivariate Data using Factor Analytic Bayesian Mixtures with an Unknown Number of Components

Recent work on overfitting Bayesian mixtures of distributions offers a p...
research
11/29/2021

Model-based clustering via skewed matrix-variate cluster-weighted models

Cluster-weighted models (CWMs) extend finite mixtures of regressions (FM...
research
07/20/2023

Sparse model-based clustering of three-way data via lasso-type penalties

Mixtures of matrix Gaussian distributions provide a probabilistic framew...
research
08/01/2019

Conditional Finite Mixtures of Poisson Distributions for Context-Dependent Neural Correlations

Parallel recordings of neural spike counts have revealed the existence o...

Please sign up or login with your details

Forgot password? Click here to reset