Kinetic foundation of the zero-inflated negative binomial model for single-cell RNA sequencing data

11/01/2019
by   Chen Jia, et al.
0

Single-cell RNA sequencing data have complex features such as dropout events, over-dispersion, and high-magnitude outliers, resulting in complicated probability distributions of mRNA abundances that are statistically characterized in terms of a zero-inflated negative binomial (ZINB) model. Here we provide a mesoscopic kinetic foundation of the widely used ZINB model based on the biochemical reaction kinetics underlying transcription. Using multiscale modeling and simplification techniques, we show that the ZINB distribution of mRNA abundance and the phenomenon of transcriptional bursting naturally emerge from a three-state stochastic transcription model. We further reveal a nontrivial quantitative relation between dropout events and transcriptional bursting, which provides novel insights into how and to what extent the burst size and burst frequency could reduce the dropout rate. Three different biophysical origins of over-dispersion are also clarified at the single-cell level.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/01/2019

Bayesian Gamma-Negative Binomial Modeling of Single-Cell RNA Sequencing Data

Background: Single-cell RNA sequencing (scRNA-seq) is a powerful profili...
research
11/24/2020

Structure learning for zero-inflated counts, with an application to single-cell RNA sequencing data

The problem of estimating the structure of a graph from observed data is...
research
02/14/2016

Surprising properties of dropout in deep networks

We analyze dropout in deep networks with rectified linear units and the ...
research
02/26/2018

DropLasso: A robust variant of Lasso for single cell RNA-seq data

Single-cell RNA sequencing (scRNA-seq) is a fast growing approach to mea...
research
02/26/2022

Dropout can Simulate Exponential Number of Models for Sample Selection Techniques

Following Coteaching, generally in the literature, two models are used i...
research
08/03/2021

MBDP: A Model-based Approach to Achieve both Robustness and Sample Efficiency via Double Dropout Planning

Model-based reinforcement learning is a widely accepted solution for sol...
research
09/13/2020

Machine Learning's Dropout Training is Distributionally Robust Optimal

This paper shows that dropout training in Generalized Linear Models is t...

Please sign up or login with your details

Forgot password? Click here to reset