DeepAI AI Chat
Log In Sign Up

A mathematical framework for raw counts of single-cell RNA-seq data analysis

by   Silvia Giulia Galfre', et al.

Single-cell RNA-seq data are challenging because of the sparseness of the read counts, the tiny expression of many relevant genes, and the variability in the efficiency of RNA extraction for different cells. We consider a simple probabilistic model for read counts, based on a negative binomial distribution for each gene, modified by a cell-dependent coefficient interpreted as an extraction efficiency. We provide two alternative fast methods to estimate the model parameters, together with the probability that a cell results in zero read counts for a gene. This allows to measure genes co-expression and differential expression in a novel way.


Structure learning for zero-inflated counts, with an application to single-cell RNA sequencing data

The problem of estimating the structure of a graph from observed data is...

Non-parametric Bayesian modelling of digital gene expression data

Next-generation sequencing technologies provide a revolutionary tool for...

Optirank: classification for RNA-Seq data with optimal ranking reference genes

Classification algorithms using RNA-Sequencing (RNA-Seq) data as input a...

Inferring Microbial Biomass Yield and Cell Weight using Probabilistic Macrochemical Modeling

Growth rates and biomass yields are key descriptors used in microbiology...

A deep generative model for single-cell RNA sequencing with application to detecting differentially expressed genes

We propose a probabilistic model for interpreting gene expression levels...

Multiscale methods for signal selection in single-cell data

Analysis of single-cell transcriptomics often relies on clustering cells...

High-dimensional Log-Error-in-Variable Regression with Applications to Microbial Compositional Data Analysis

In microbiome and genomic study, the regression of compositional data ha...