Robust Integrative Biclustering for Multi-view Data

11/11/2021
by   W. Zhang, et al.
0

In many biomedical research, multiple views of data (e.g., genomics, proteomics) are available, and a particular interest might be the detection of sample subgroups characterized by specific groups of variables. Biclustering methods are well-suited for this problem as they assume that specific groups of variables might be relevant only to specific groups of samples. Many biclustering methods exist for identifying row-column clusters in a view but few methods exist for data from multiple views. The few existing algorithms are heavily dependent on regularization parameters for getting row-column clusters, and they impose unnecessary burden on users thus limiting their use in practice. We extend an existing biclustering method based on sparse singular value decomposition for single-view data to data from multiple views. Our method, integrative sparse singular value decomposition (iSSVD), incorporates stability selection to control Type I error rates, estimates the probability of samples and variables to belong to a bicluster, finds stable biclusters, and results in interpretable row-column associations. Simulations and real data analyses show that iSSVD outperforms several other single- and multi-view biclustering methods and is able to detect meaningful biclusters. iSSVD is a user-friendly, computationally efficient algorithm that will be useful in many disease subtyping applications.

READ FULL TEXT

page 3

page 33

page 34

page 35

research
08/13/2023

Weighted Sparse Partial Least Squares for Joint Sample and Feature Selection

Sparse Partial Least Squares (sPLS) is a common dimensionality reduction...
research
05/07/2021

Double-matched matrix decomposition for multi-view data

We consider the problem of extracting joint and individual signals from ...
research
12/16/2019

Latent Complete Row Space Recovery for Multi-view Subspace Clustering

Multi-view subspace clustering has been applied to applications such as ...
research
10/23/2017

SMSSVD - SubMatrix Selection Singular Value Decomposition

High throughput biomedical measurements normally capture multiple overla...
research
08/20/2013

Flexible Low-Rank Statistical Modeling with Side Information

We propose a general framework for reduced-rank modeling of matrix-value...
research
11/06/2018

Stacked Penalized Logistic Regression for Selecting Views in Multi-View Learning

In multi-view learning, features are organized into multiple sets called...
research
09/25/2020

ScreeNOT: Exact MSE-Optimal Singular Value Thresholding in Correlated Noise

We derive a formula for optimal hard thresholding of the singular value ...

Please sign up or login with your details

Forgot password? Click here to reset