Joint NMF for Identification of Shared Features in Datasets and a Dataset Distance Measure

07/11/2022
by   Hannah Friedman, et al.
0

In this paper, we derive a new method for determining shared features of datasets by employing joint non-negative matrix factorization and analyzing the resulting factorizations. Our approach uses the joint factorization of two dataset matrices X_1,X_2 into non-negative matrices X_1 = AS_1, X_2 = AS_2 to derive a similarity measure that determines how well a shared basis for X_1, X_2 approximates each dataset. We also propose a dataset distance measure built upon this method and the learned factorization. Our method is able to successfully identity differences in structure in both image and text datasets. Potential applications include classification, detecting plagiarism or other manipulation, and learning relationships between data sets.

READ FULL TEXT

page 3

page 4

page 5

research
03/24/2021

Feature Weighted Non-negative Matrix Factorization

Non-negative Matrix Factorization (NMF) is one of the most popular techn...
research
07/12/2019

A Quantum-inspired Classical Algorithm for Separable Non-negative Matrix Factorization

Non-negative Matrix Factorization (NMF) asks to decompose a (entry-wise)...
research
05/07/2018

Semi-Orthogonal Non-Negative Matrix Factorization

Non-negative Matrix Factorization (NMF) is a popular clustering and dime...
research
08/09/2023

Multi-modal Multi-view Clustering based on Non-negative Matrix Factorization

By combining related objects, unsupervised machine learning techniques a...
research
05/14/2018

Integrating Hypertension Phenotype and Genotype with Hybrid Non-negative Matrix Factorization

Hypertension is a heterogeneous syndrome in need of improved subtyping u...
research
09/07/2019

A Non-Negative Factorization approach to node pooling in Graph Convolutional Neural Networks

The paper discusses a pooling mechanism to induce subsampling in graph s...
research
10/08/2018

Detecting Memorization in ReLU Networks

We propose a new notion of `non-linearity' of a network layer with respe...

Please sign up or login with your details

Forgot password? Click here to reset