Analyzing scRNA-seq data by CCP-assisted UMAP and t-SNE

06/23/2023
by   Yuta Hozumi, et al.
0

Single-cell RNA sequencing (scRNA-seq) is widely used to reveal heterogeneity in cells, which has given us insights into cell-cell communication, cell differentiation, and differential gene expression. However, analyzing scRNA-seq data is a challenge due to sparsity and the large number of genes involved. Therefore, dimensionality reduction and feature selection are important for removing spurious signals and enhancing downstream analysis. Correlated clustering and projection (CCP) was recently introduced as an effective method for preprocessing scRNA-seq data. CCP utilizes gene-gene correlations to partition the genes and, based on the partition, employs cell-cell interactions to obtain super-genes. Because CCP is a data-domain approach that does not require matrix diagonalization, it can be used in many downstream machine learning tasks. In this work, we utilize CCP as an initialization tool for uniform manifold approximation and projection (UMAP) and t-distributed stochastic neighbor embedding (t-SNE). By using eight publicly available datasets, we have found that CCP significantly improves UMAP and t-SNE visualization and dramatically improve their accuracy.

READ FULL TEXT

page 7

page 10

page 11

page 12

page 15

research
07/14/2023

Single-cell RNA-seq data imputation using Feature Propagation

While single-cell RNA sequencing provides an understanding of the transc...
research
10/15/2021

SGEN: Single-cell Sequencing Graph Self-supervised Embedding Network

Single-cell sequencing has a significant role to explore biological proc...
research
01/26/2023

SparCA: Sparse Compressed Agglomeration for Feature Extraction and Dimensionality Reduction

The most effective dimensionality reduction procedures produce interpret...
research
10/25/2022

A single-cell gene expression language model

Gene regulation is a dynamic process that connects genotype and phenotyp...
research
06/15/2021

Active feature selection discovers minimal gene-sets for classifying cell-types and disease states in single-cell mRNA-seq data

Sequencing costs currently prohibit the application of single cell mRNA-...
research
09/17/2020

Identification of Biomarkers Controlling Cell Fate In Blood Cell Development

A blood cell lineage consists of several consecutive developmental stage...
research
07/05/2020

Handling high correlations in the feature gene selection using Single-Cell RNA sequencing data

Motivation: Selecting feature genes and predicting cells' phenotype are ...

Please sign up or login with your details

Forgot password? Click here to reset