Contrastive Cycle Adversarial Autoencoders for Single-cell Multi-omics Alignment and Integration

12/05/2021
by   Xuesong Wang, et al.
0

Muilti-modality data are ubiquitous in biology, especially that we have entered the multi-omics era, when we can measure the same biological object (cell) from different aspects (omics) to provide a more comprehensive insight into the cellular system. When dealing with such multi-omics data, the first step is to determine the correspondence among different modalities. In other words, we should match data from different spaces corresponding to the same object. This problem is particularly challenging in the single-cell multi-omics scenario because such data are very sparse with extremely high dimensions. Secondly, matched single-cell multi-omics data are rare and hard to collect. Furthermore, due to the limitations of the experimental environment, the data are usually highly noisy. To promote the single-cell multi-omics research, we overcome the above challenges, proposing a novel framework to align and integrate single-cell RNA-seq data and single-cell ATAC-seq data. Our approach can efficiently map the above data with high sparsity and noise from different spaces to a low-dimensional manifold in a unified space, making the downstream alignment and integration straightforward. Compared with the other state-of-the-art methods, our method performs better in both simulated and real single-cell data. The proposed method is helpful for the single-cell multi-omics research. The improvement for integration on the simulated data is significant.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/03/2023

Is your data alignable? Principled and interpretable alignability testing and integration of single-cell data

Single-cell data integration can provide a comprehensive molecular view ...
research
03/03/2022

Graph Neural Networks for Multimodal Single-Cell Data Integration

Recent advances in multimodal single-cell technologies have enabled simu...
research
05/31/2022

AVIDA: Alternating method for Visualizing and Integrating Data

High-dimensional multimodal data arises in many scientific fields. The i...
research
05/19/2022

scICML: Information-theoretic Co-clustering-based Multi-view Learning for the Integrative Analysis of Single-cell Multi-omics data

Modern high-throughput sequencing technologies have enabled us to profil...
research
01/12/2016

Robust Lineage Reconstruction from High-Dimensional Single-Cell Data

Single-cell gene expression data provide invaluable resources for system...
research
06/15/2023

Multi-omics Prediction from High-content Cellular Imaging with Deep Learning

High-content cellular imaging, transcriptomics, and proteomics data prov...
research
10/02/2022

Towards Learned Simulators for Cell Migration

Simulators driven by deep learning are gaining popularity as a tool for ...

Please sign up or login with your details

Forgot password? Click here to reset