Heterogeneous Matrix Factorization: When Features Differ by Datasets

05/28/2023
by   Naichen Shi, et al.
0

In myriad statistical applications, data are collected from related but heterogeneous sources. These sources share some commonalities while containing idiosyncratic characteristics. More specifically, consider the setting where observation matrices from N sources {M_i}_i=1^N are generated from a few common and source-specific factors. Is it possible to recover the shared and source-specific factors? We show that under appropriate conditions on the alignment of source-specific factors, the problem is well-defined and both shared and source-specific factors are identifiable under a constrained matrix factorization objective. To solve this objective, we propose a new class of matrix factorization algorithms, called Heterogeneous Matrix Factorization. HMF is easy to implement, enjoys local linear convergence under suitable assumptions, and is intrinsically distributed. Through a variety of empirical studies, we showcase the advantageous properties of HMF and its potential application in feature extraction and change detection.

READ FULL TEXT
research
01/23/2014

Matrix factorization with Binary Components

Motivated by an application in computational biology, we consider low-ra...
research
07/17/2022

Personalized PCA: Decoupling Shared and Unique Features

In this paper, we tackle a significant challenge in PCA: heterogeneity. ...
research
07/02/2013

Data Fusion by Matrix Factorization

For most problems in science and engineering we can obtain data sets tha...
research
11/28/2018

Deep Collective Matrix Factorization for Augmented Multi-View Learning

Learning by integrating multiple heterogeneous data sources is a common ...
research
12/03/2021

ALX: Large Scale Matrix Factorization on TPUs

We present ALX, an open-source library for distributed matrix factorizat...
research
03/15/2012

A Bayesian Matrix Factorization Model for Relational Data

Relational learning can be used to augment one data source with other co...
research
10/27/2020

C-SALT: Mining Class-Specific ALTerations in Boolean Matrix Factorization

Given labeled data represented by a binary matrix, we consider the task ...

Please sign up or login with your details

Forgot password? Click here to reset