A nonparametric Bayesian approach to simultaneous subject and cell heterogeneity discovery for single cell RNA-seq data

12/17/2019
by   Qiuyu Wu, et al.
0

The advent of the single cell sequencing era opens new avenues for the personalized treatment. The first but important step is to discover the subject heterogeneity at the single cell resolution. In this article, we address the two-level-clustering problem of simultaneous subject subgroup discovery (subject level) and cell type detection (cell level) based on the scRNA-seq data from multiple subjects. However, the current statistical approaches either cluster cells without considering the subject heterogeneity or group subjects not using the single-cell information. To overcome the challenges and fill the gap between cell clustering and subject grouping, we develop a solid nonparametric Bayesian model SCSC (Subject and Cell clustering for Single-Cell expression data) to achieve subject and cell grouping at the same time. SCSC does not need to prespecify the subject subgroup number or the cell type number, automatically induces subject subgroup structures and matches cell types across subjects, and directly models the scRNA-seq raw count data by deliberately considering the data's dropouts, library sizes, and over-dispersion. A computationally efficient blocked Gibbs sampler is proposed for the posterior inference. The simulation and the application to a multi-subject iPSC scRNA-seq dataset validate the function of SCSC to discover subject and cell heterogeneity.

READ FULL TEXT
research
04/04/2021

SimCD: Simultaneous Clustering and Differential expression analysis for single-cell transcriptomic data

Single-Cell RNA sequencing (scRNA-seq) measurements have facilitated gen...
research
12/05/2022

Shared Differential Clustering across Single-cell RNA Sequencing Datasets with the Hierarchical Dirichlet Process

Single-cell RNA sequencing (scRNA-seq) is powerful technology that allow...
research
01/03/2020

Review of Single-cell RNA-seq Data Clustering for Cell Type Identification and Characterization

In recent years, the advances in single-cell RNA-seq techniques have ena...
research
03/04/2023

Stochastic networks theory to model single-cell genomic count data

We propose a novel way of representing and analysing single-cell genomic...
research
03/06/2020

Heterogeneity Loss to Handle Intersubject and Intrasubject Variability in Cancer

Developing nations lack adequate number of hospitals with modern equipme...
research
06/06/2021

Fisher-Pitman permutation tests based on nonparametric Poisson mixtures with application to single cell genomics

This paper investigates the theoretical and empirical performance of Fis...
research
07/15/2014

Automatic discovery of cell types and microcircuitry from neural connectomics

Neural connectomics has begun producing massive amounts of data, necessi...

Please sign up or login with your details

Forgot password? Click here to reset