RZiMM-scRNA: A regularized zero-inflated mixture model framework for single-cell RNA-seq data

10/25/2021
by   Xinlei Mi, et al.
0

Applications of single-cell RNA sequencing in various biomedical research areas have been blooming. This new technology provides unprecedented opportunities to study disease heterogeneity at the cellular level. However, unique characteristics of scRNA-seq data, including large dimensionality, high dropout rates, and possibly batch effects, bring great difficulty into the analysis of such data. Not appropriately addressing these issues obstructs true scientific discovery. Herein, we propose a unified Regularized Zero-inflated Mixture Model framework designed for scRNA-seq data (RZiMM-scRNA) to simultaneously detect cell subgroups and identify gene differential expression based on a developed importance score, accounting for both dropouts and batch effects. We conduct extensive simulation studies in which we evaluate the performance of RZiMM-scRNA and compare it with several popular methods, including Seurat, SC3, K-Means, and Hierarchical Clustering. Simulation results show that RZiMM-scRNA demonstrates superior clustering performance and enhanced biomarker detection accuracy compared to alternative methods, especially when cell subgroups are less distinct, verifying the robustness of our method. Our empirical investigations focus on two brain tumor studies dealing with astrocytoma of various grades, including the most malignant of all brain tumors, glioblastoma multiforme (GBM). Our goal is to delineate cell heterogeneity and identify driving biomarkers associated with these tumors. Notably, RZiMM-scNRA successfully identifies a small group of oligodendrocyte cells which has drawn much attention in biomedical literature on brain cancers.

READ FULL TEXT

page 11

page 12

page 24

page 25

page 26

research
04/04/2021

SimCD: Simultaneous Clustering and Differential expression analysis for single-cell transcriptomic data

Single-Cell RNA sequencing (scRNA-seq) measurements have facilitated gen...
research
12/05/2022

Shared Differential Clustering across Single-cell RNA Sequencing Datasets with the Hierarchical Dirichlet Process

Single-cell RNA sequencing (scRNA-seq) is powerful technology that allow...
research
04/06/2017

DIMM-SC: A Dirichlet mixture model for clustering droplet-based single cell transcriptomic data

Motivation: Single cell transcriptome sequencing (scRNA-Seq) has become ...
research
08/01/2019

Bayesian Gamma-Negative Binomial Modeling of Single-Cell RNA Sequencing Data

Background: Single-cell RNA sequencing (scRNA-seq) is a powerful profili...
research
11/07/2022

Uncertainty Quantification for Atlas-Level Cell Type Transfer

Single-cell reference atlases are large-scale, cell-level maps that capt...
research
11/28/2022

Robust structured heterogeneity analysis approach for high-dimensional data

Revealing relationships between genes and disease phenotypes is a critic...

Please sign up or login with your details

Forgot password? Click here to reset