Data Fusion by Matrix Factorization

07/02/2013
by   Marinka Zitnik, et al.
0

For most problems in science and engineering we can obtain data sets that describe the observed system from various perspectives and record the behavior of its individual components. Heterogeneous data sets can be collectively mined by data fusion. Fusion can focus on a specific target relation and exploit directly associated data together with contextual data and data about system's constraints. In the paper we describe a data fusion approach with penalized matrix tri-factorization (DFMF) that simultaneously factorizes data matrices to reveal hidden associations. The approach can directly consider any data that can be expressed in a matrix, including those from feature-based representations, ontologies, associations and networks. We demonstrate the utility of DFMF for gene function prediction task with eleven different data sources and for prediction of pharmacologic actions by fusing six data sources. Our data fusion algorithm compares favorably to alternative data integration approaches and achieves higher accuracy than can be obtained from any single data source alone.

READ FULL TEXT

page 3

page 7

page 9

research
08/10/2017

Jumping across biomedical contexts using compressive data fusion

Motivation: The rapid growth of diverse biological data allows us to con...
research
05/28/2023

Heterogeneous Matrix Factorization: When Features Differ by Datasets

In myriad statistical applications, data are collected from related but ...
research
11/06/2012

Kernelized Bayesian Matrix Factorization

We extend kernelized matrix factorization with a fully Bayesian treatmen...
research
11/28/2018

Deep Collective Matrix Factorization for Augmented Multi-View Learning

Learning by integrating multiple heterogeneous data sources is a common ...
research
11/29/2021

Efficient Estimation Under Data Fusion

We aim to make inferences about a smooth, finite-dimensional parameter b...
research
04/26/2013

Supervised Heterogeneous Multiview Learning for Joint Association Study and Disease Diagnosis

Given genetic variations and various phenotypical traits, such as Magnet...
research
10/15/2022

MIXER: Multiattribute, Multiway Fusion of Uncertain Pairwise Affinities

We present a multiway fusion algorithm capable of directly processing un...

Please sign up or login with your details

Forgot password? Click here to reset