A deep generative model for gene expression profiles from single-cell RNA sequencing

09/07/2017
by   Romain Lopez, et al.
0

We propose a probabilistic model for interpreting gene expression levels that are observed through single-cell RNA sequencing. In the model, each cell has a low-dimensional latent representation. Additional latent variables account for technical effects that may erroneously set some observations of gene expression levels to zero. Conditional distributions are specified by neural networks, giving the proposed model enough flexibility to fit the data well. We use variational inference and stochastic optimization to approximate the posterior distribution. The inference procedure scales to over one million cells, whereas competing algorithms do not. Even for smaller datasets, for several tasks, the proposed procedure outperforms state-of-the-art methods like ZIFA and ZINB-WaVE. We also extend our framework to account for batch effects and other confounding factors, and propose a Bayesian hypothesis test for differential expression that outperforms DESeq2.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/13/2017

A deep generative model for single-cell RNA sequencing with application to detecting differentially expressed genes

We propose a probabilistic model for interpreting gene expression levels...
research
07/09/2022

Variational Mixtures of ODEs for Inferring Cellular Gene Expression Dynamics

A key problem in computational biology is discovering the gene expressio...
research
11/07/2022

Learning Causal Representations of Single Cells via Sparse Mechanism Shift Modeling

Latent variable models such as the Variational Auto-Encoder (VAE) have b...
research
05/06/2019

A joint model of unpaired data from scRNA-seq and spatial transcriptomics for imputing missing gene expression measurements

Spatial studies of transcriptome provide biologists with gene expression...
research
11/15/2019

Batch correction of high-dimensional data

Biomedical research often produces high-dimensional data confounded by b...
research
10/30/2022

ISG: I can See Your Gene Expression

This paper aims to predict gene expression from a histology slide image ...
research
09/14/2022

Modelling Technical and Biological Effects in scRNA-seq data with Scalable GPLVMs

Single-cell RNA-seq datasets are growing in size and complexity, enablin...

Please sign up or login with your details

Forgot password? Click here to reset