Sparse Inverse Covariance Estimation for High-throughput microRNA Sequencing Data in the Poisson Log-Normal Graphical Model

08/15/2017
by   David Sinclair, et al.
0

We introduce the Poisson Log-Normal Graphical Model for count data, and present a normality transformation for data arising from this distribution. The model and transformation are feasible for high-throughput microRNA (miRNA) sequencing data and directly account for known overdispersion relationships present in this data set. The model allows for network dependencies to be modeled, and we provide an algorithm which utilizes a one-step EM based result in order to allow for a provable increase in performance in determining the network structure. The model is shown to provide an increase in performance in simulation settings over a range of network structures. The model is applied to high-throughput miRNA sequencing data from patients with breast cancer from The Cancer Genome Atlas (TCGA). By selecting the most highly connected miRNA molecules in the fitted network we find that nearly all of them are known to be involved in the regulation of breast cancer.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/24/2017

Automatic breast cancer grading in lymph nodes using a deep neural network

The progression of breast cancer can be quantified in lymph node whole-s...
research
05/05/2020

A Pipeline for Integrated Theory and Data-Driven Modeling of Genomic and Clinical Data

High throughput genome sequencing technologies such as RNA-Seq and Micro...
research
06/26/2018

Bayesian Multi-study Factor Analysis for High-throughput Biological Data

This paper presents a new modeling strategy for joint unsupervised analy...
research
05/08/2020

The scalable Birth-Death MCMC Algorithm for Mixed Graphical Model Learning with Application to Genomic Data Integration

Recent advances in biological research have seen the emergence of high-t...
research
10/22/2018

Bayesian multi-domain learning for cancer subtype discovery from next-generation sequencing count data

Precision medicine aims for personalized prognosis and therapeutics by u...
research
03/20/2014

Network-based Isoform Quantification with RNA-Seq Data for Cancer Transcriptome Analysis

High-throughput mRNA sequencing (RNA-Seq) is widely used for transcript ...
research
11/30/2017

A Multivariate Poisson-Log Normal Mixture Model for Clustering Transcriptome Sequencing Data

High-dimensional data of discrete and skewed nature is commonly encounte...

Please sign up or login with your details

Forgot password? Click here to reset