Visualizing hierarchies in scRNA-seq data using a density tree-biased autoencoder

02/11/2021
by   Quentin Garrido, et al.
0

Single cell RNA sequencing (scRNA-seq) data makes studying the development of cells possible at unparalleled resolution. Given that many cellular differentiation processes are hierarchical, their scRNA-seq data is expected to be approximately tree-shaped in gene expression space. Inference and representation of this tree-structure in two dimensions is highly desirable for biological interpretation and exploratory analysis. Our two contributions are an approach for identifying a meaningful tree structure from high-dimensional scRNA-seq data, and a visualization method respecting the tree-structure. We extract the tree structure by means of a density based minimum spanning tree on a vector quantization of the data and show that it captures biological information well. We then introduce DTAE, a tree-biased autoencoder that emphasizes the tree structure of the data in low dimensional space. We compare to other dimension reduction methods and demonstrate the success of our method experimentally. Our implementation relying on PyTorch and Higra is available at github.com/hci-unihd/DTAE.

READ FULL TEXT
research
10/15/2021

SGEN: Single-cell Sequencing Graph Self-supervised Embedding Network

Single-cell sequencing has a significant role to explore biological proc...
research
02/13/2020

Tree-SNE: Hierarchical Clustering and Visualization Using t-SNE

t-SNE and hierarchical clustering are popular methods of exploratory dat...
research
08/05/2015

Dimension Reduction with Non-degrading Generalization

Visualizing high dimensional data by projecting them into two or three d...
research
11/28/2018

Reconstructing probabilistic trees of cellular differentiation from single-cell RNA-seq data

Until recently, transcriptomics was limited to bulk RNA sequencing, obsc...
research
06/18/2019

Learning data representation using modified autoencoder for the integrative analysis of multi-omics data

In integrative analyses of omics data, it is often of interest to extrac...
research
07/12/2019

Improving the Projection of Global Structures in Data through Spanning Trees

The connection of edges in a graph generates a structure that is indepen...
research
09/28/2020

Hierarchical correction of p-values via a tree running Ornstein-Uhlenbeck process

Statistical testing is classically used as an exploratory tool to search...

Please sign up or login with your details

Forgot password? Click here to reset