DeepAI AI Chat
Log In Sign Up

Data Integration Via Analysis of Subspaces (DIVAS)

by   Jack Prothero, et al.

Modern data collection in many data paradigms, including bioinformatics, often incorporates multiple traits derived from different data types (i.e. platforms). We call this data multi-block, multi-view, or multi-omics data. The emergent field of data integration develops and applies new methods for studying multi-block data and identifying how different data types relate and differ. One major frontier in contemporary data integration research is methodology that can identify partially-shared structure between sub-collections of data types. This work presents a new approach: Data Integration Via Analysis of Subspaces (DIVAS). DIVAS combines new insights in angular subspace perturbation theory with recent developments in matrix signal processing and convex-concave optimization into one algorithm for exploring partially-shared structure. Based on principal angles between subspaces, DIVAS provides built-in inference on the results of the analysis, and is effective even in high-dimension-low-sample-size (HDLSS) situations.


page 5

page 22

page 31

page 32

page 35

page 36


Principal Structure Identification: Fast Disentanglement of Multi-source Dataset

Analysis of multi-source data, where data on the same objects are collec...

MM-PCA: Integrative Analysis of Multi-group and Multi-view Data

Data integration is the problem of combining multiple data groups (studi...

Some Options for L1-Subspace Signal Processing

We describe ways to define and calculate L_1-norm signal subspaces which...

Structural Learning and Integrative Decomposition of Multi-View Data

The increased availability of the multi-view data (data on the same samp...

Hierarchical nuclear norm penalization for multi-view data

The prevalence of data collected on the same set of samples from multipl...

Classification via Incoherent Subspaces

This article presents a new classification framework that can extract in...