scICML: Information-theoretic Co-clustering-based Multi-view Learning for the Integrative Analysis of Single-cell Multi-omics data

05/19/2022
by   Pengcheng Zeng, et al.
0

Modern high-throughput sequencing technologies have enabled us to profile multiple molecular modalities from the same single cell, providing unprecedented opportunities to assay celluar heterogeneity from multiple biological layers. However, the datasets generated from these technologies tend to have high level of noise and are highly sparse, bringing challenges to data analysis. In this paper, we develop a novel information-theoretic co-clustering-based multi-view learning (scICML) method for multi-omics single-cell data integration. scICML utilizes co-clusterings to aggregate similar features for each view of data and uncover the common clustering pattern for cells. In addition, scICML automatically matches the clusters of the linked features across different data types for considering the biological dependency structure across different types of genomic features. Our experiments on four real-world datasets demonstrate that scICML improves the overall clustering performance and provides biological insights into the data analysis of peripheral blood mononuclear cells.

READ FULL TEXT
research
11/25/2020

Consistency-aware and Inconsistency-aware Graph-based Multi-view Clustering

Multi-view data analysis has gained increasing popularity because multi-...
research
12/18/2019

Cluster Analysis of High-Dimensional scRNA Sequencing Data

With ongoing developments and innovations in single-cell RNA sequencing ...
research
05/25/2019

Multi-view Information-theoretic Co-clustering for Co-occurrence Data

Multi-view clustering has received much attention recently. Most of the ...
research
10/12/2020

BayReL: Bayesian Relational Learning for Multi-omics Data Integration

High-throughput molecular profiling technologies have produced high-dime...
research
12/05/2021

Contrastive Cycle Adversarial Autoencoders for Single-cell Multi-omics Alignment and Integration

Muilti-modality data are ubiquitous in biology, especially that we have ...
research
05/19/2022

Confident Clustering via PCA Compression Ratio and Its Application to Single-cell RNA-seq Analysis

Unsupervised clustering algorithms for vectors has been widely used in t...
research
03/04/2023

Stochastic networks theory to model single-cell genomic count data

We propose a novel way of representing and analysing single-cell genomic...

Please sign up or login with your details

Forgot password? Click here to reset