Data Integration Via Analysis of Subspaces (DIVAS)

12/01/2022
by   Jack Prothero, et al.
0

Modern data collection in many data paradigms, including bioinformatics, often incorporates multiple traits derived from different data types (i.e. platforms). We call this data multi-block, multi-view, or multi-omics data. The emergent field of data integration develops and applies new methods for studying multi-block data and identifying how different data types relate and differ. One major frontier in contemporary data integration research is methodology that can identify partially-shared structure between sub-collections of data types. This work presents a new approach: Data Integration Via Analysis of Subspaces (DIVAS). DIVAS combines new insights in angular subspace perturbation theory with recent developments in matrix signal processing and convex-concave optimization into one algorithm for exploring partially-shared structure. Based on principal angles between subspaces, DIVAS provides built-in inference on the results of the analysis, and is effective even in high-dimension-low-sample-size (HDLSS) situations.

READ FULL TEXT

page 5

page 22

page 31

page 32

page 35

page 36

research
03/26/2022

Principal Structure Identification: Fast Disentanglement of Multi-source Dataset

Analysis of multi-source data, where data on the same objects are collec...
research
11/12/2019

MM-PCA: Integrative Analysis of Multi-group and Multi-view Data

Data integration is the problem of combining multiple data groups (studi...
research
09/04/2013

Some Options for L1-Subspace Signal Processing

We describe ways to define and calculate L_1-norm signal subspaces which...
research
07/20/2017

Structural Learning and Integrative Decomposition of Multi-View Data

The increased availability of the multi-view data (data on the same samp...
research
08/07/2018

Generalized Integrative Principal Component Analysis for Multi-Type Data with Block-Wise Missing Structure

High-dimensional multi-source data are encountered in many fields. Despi...
research
03/27/2019

Feature Selection for Data Integration with Mixed Multi-view Data

Data integration methods that analyze multiple sources of data simultane...
research
05/10/2010

Classification via Incoherent Subspaces

This article presents a new classification framework that can extract in...

Please sign up or login with your details

Forgot password? Click here to reset