SimCD: Simultaneous Clustering and Differential expression analysis for single-cell transcriptomic data

04/04/2021
by   Seyednami Niyakan, et al.
0

Single-Cell RNA sequencing (scRNA-seq) measurements have facilitated genome-scale transcriptomic profiling of individual cells, with the hope of deconvolving cellular dynamic changes in corresponding cell sub-populations to better understand molecular mechanisms of different development processes. Several scRNA-seq analysis methods have been proposed to first identify cell sub-populations by clustering and then separately perform differential expression analysis to understand gene expression changes. Their corresponding statistical models and inference algorithms are often designed disjointly. We develop a new method – SimCD – that explicitly models cell heterogeneity and dynamic differential changes in one unified hierarchical gamma-negative binomial (hGNB) model, allowing simultaneous cell clustering and differential expression analysis for scRNA-seq data. Our method naturally defines cell heterogeneity by dynamic expression changes, which is expected to help achieve better performances on the two tasks compared to the existing methods that perform them separately. In addition, SimCD better models dropout (zero inflation) in scRNA-seq data by both cell- and gene-level factors and obviates the need for sophisticated pre-processing steps such as normalization, thanks to the direct modeling of scRNA-seq count data by the rigorous hGNB model with an efficient Gibbs sampling inference algorithm. Extensive comparisons with the state-of-the-art methods on both simulated and real-world scRNA-seq count data demonstrate the capability of SimCD to discover cell clusters and capture dynamic expression changes. Furthermore, SimCD helps identify several known genes affected by food deprivation in hypothalamic neuron cell subtypes as well as some new potential markers, suggesting the capability of SimCD for bio-marker discovery.

READ FULL TEXT

page 7

page 8

page 18

research
03/07/2018

Differential Expression Analysis of Dynamical Sequencing Count Data with a Gamma Markov Chain

Next-generation sequencing (NGS) to profile temporal changes in living s...
research
08/01/2019

Bayesian Gamma-Negative Binomial Modeling of Single-Cell RNA Sequencing Data

Background: Single-cell RNA sequencing (scRNA-seq) is a powerful profili...
research
12/17/2019

A nonparametric Bayesian approach to simultaneous subject and cell heterogeneity discovery for single cell RNA-seq data

The advent of the single cell sequencing era opens new avenues for the p...
research
10/25/2021

RZiMM-scRNA: A regularized zero-inflated mixture model framework for single-cell RNA-seq data

Applications of single-cell RNA sequencing in various biomedical researc...
research
02/26/2018

DropLasso: A robust variant of Lasso for single cell RNA-seq data

Single-cell RNA sequencing (scRNA-seq) is a fast growing approach to mea...
research
01/06/2022

Exponential family measurement error models for single-cell CRISPR screens

CRISPR genome engineering and single-cell RNA sequencing have transforme...
research
11/14/2022

Bayesian Reconstruction and Differential Testing of Excised mRNA

Characterizing the differential excision of mRNA is critical for underst...

Please sign up or login with your details

Forgot password? Click here to reset