Feature Selection for Data Integration with Mixed Multi-view Data

03/27/2019
by   Yulia Baker, et al.
0

Data integration methods that analyze multiple sources of data simultaneously can often provide more holistic insights than can separate inquiries of each data source. Motivated by the advantages of data integration in the era of "big data", we investigate feature selection for high-dimensional multi-view data with mixed data types (e.g. continuous, binary, count-valued). This heterogeneity of multi-view data poses numerous challenges for existing feature selection methods. However, after critically examining these issues through empirical and theoretically-guided lenses, we develop a practical solution, the Block Randomized Adaptive Iterative Lasso (B-RAIL), which combines the strengths of the randomized Lasso, adaptive weighting schemes, and stability selection. B-RAIL serves as a versatile data integration method for sparse regression and graph selection, and we demonstrate the effectiveness of B-RAIL through extensive simulations and a case study to infer the ovarian cancer gene regulatory network. In this case study, B-RAIL successfully identifies well-known biomarkers associated with ovarian cancer and hints at novel candidates for future ovarian cancer research.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/11/2019

Integrative Generalized Convex Clustering Optimization and Feature Selection for Mixed Multi-View Data

In mixed multi-view data, multiple sets of diverse features are measured...
research
05/26/2023

Multi-Objective Genetic Algorithm for Multi-View Feature Selection

Multi-view datasets offer diverse forms of data that can enhance predict...
research
04/05/2022

Incremental Unsupervised Feature Selection for Dynamic Incomplete Multi-view Data

Multi-view unsupervised feature selection has been proven to be efficien...
research
03/12/2018

Dissimilarity-based representation for radiomics applications

Radiomics is a term which refers to the analysis of the large amount of ...
research
10/30/2020

View selection in multi-view stacking: Choosing the meta-learner

Multi-view stacking is a framework for combining information from differ...
research
04/25/2019

Adaptive Collaborative Similarity Learning for Unsupervised Multi-view Feature Selection

In this paper, we investigate the research problem of unsupervised multi...
research
12/01/2022

Data Integration Via Analysis of Subspaces (DIVAS)

Modern data collection in many data paradigms, including bioinformatics,...

Please sign up or login with your details

Forgot password? Click here to reset