DeepAI
Log In Sign Up

Visualizing structure and transitions in high-dimensional biological data

09/16/2020
by   Smita Krishnaswamy, et al.
0

The high-dimensional data created by high-throughput technologies require visualization tools that reveal data structure and patterns in an intuitive form. We present PHATE, a visualization method that captures both local and global nonlinear structure using an information-geometric distance between data points. We compare PHATE to other tools on a variety of artificial and biological datasets, and find that it consistently preserves a range of patterns in data, including continual progressions, branches and clusters, better than other tools. We define a manifold preservation metric, which we call denoised embedding manifold preservation (DEMaP), and show that PHATE produces lower-dimensional embeddings that are quantitatively better denoised as compared to existing visualization methods. An analysis of a newly generated single-cell RNA sequencing dataset on human germ-layer differentiation demonstrates how PHATE reveals unique biological insight into the main developmental branches, including identification of three previously undescribed subpopulations. We also show that PHATE is applicable to a wide variety of data types, including mass cytometry, single-cell RNA sequencing, Hi-C and gut microbiome data.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 6

page 7

page 8

page 9

08/03/2021

Visualizing Data using GTSNE

We present a new method GTSNE to visualize high-dimensional data points ...
10/15/2021

SGEN: Single-cell Sequencing Graph Self-supervised Embedding Network

Single-cell sequencing has a significant role to explore biological proc...
07/05/2021

An Analytical Survey on Recent Trends in High Dimensional Data Visualization

Data visualization is the process by which data of any size or dimension...
11/24/2020

Structure learning for zero-inflated counts, with an application to single-cell RNA sequencing data

The problem of estimating the structure of a graph from observed data is...
11/30/2021

Towards a comprehensive visualization of structure in data

Dimensional data reduction methods are fundamental to explore and visual...
11/22/2018

Inference of the three-dimensional chromatin structure and its temporal behavior

Understanding the three-dimensional (3D) structure of the genome is esse...