Structural Learning and Integrative Decomposition of Multi-View Data

07/20/2017
by   Irina Gaynanova, et al.
0

The increased availability of the multi-view data (data on the same samples from multiple sources) has led to strong interest in models based on low-rank matrix factorizations. These models represent each data view via shared and individual components, and have been successfully applied for exploratory dimension reduction, association analysis between the views, and further learning tasks such as consensus clustering. Despite these advances, there remain significant challenges in modeling partially-shared components, and identifying the number of components of each type (shared/partially-shared/individual). In this work, we formulate a novel linked component model that directly incorporates partially-shared structures. We call this model SLIDE for Structural Learning and Integrative DEcomposition of multi-view data. We prove the existence of SLIDE decomposition and explicitly characterize the identifiability conditions. The proposed model fitting and selection techniques allow for joint identification of the number of components of each type, in contrast to existing sequential approaches. In our empirical studies, SLIDE demonstrates excellent performance in both signal estimation and component selection. We further illustrate the methodology on the breast cancer data from The Cancer Genome Atlas repository.

READ FULL TEXT

page 16

page 18

research
06/26/2022

Hierarchical nuclear norm penalization for multi-view data

The prevalence of data collected on the same set of samples from multipl...
research
11/12/2019

MM-PCA: Integrative Analysis of Multi-group and Multi-view Data

Data integration is the problem of combining multiple data groups (studi...
research
06/09/2019

Integrative Factorization of Bidimensionally Linked Matrices

Advances in molecular "omics'" technologies have motivated new methodolo...
research
12/02/2020

Partially Shared Semi-supervised Deep Matrix Factorization with Multi-view Data

Since many real-world data can be described from multiple views, multi-v...
research
12/01/2022

Data Integration Via Analysis of Subspaces (DIVAS)

Modern data collection in many data paradigms, including bioinformatics,...
research
03/26/2022

Principal Structure Identification: Fast Disentanglement of Multi-source Dataset

Analysis of multi-source data, where data on the same objects are collec...
research
03/17/2020

Directionally Dependent Multi-View Clustering Using Copula Model

In recent biomedical scientific problems, it is a fundamental issue to i...

Please sign up or login with your details

Forgot password? Click here to reset