A sampling algorithm to compute the set of feasible solutions for non-negative matrix factorization with an arbitrary rank

by   Ragnhild Laursen, et al.

Non-negative Matrix Factorization (NMF) is a useful method to extract features from multivariate data, but an important and sometimes neglected concern is that NMF can result in non-unique solutions. Often, there exist a Set of Feasible Solutions (SFS), which makes it more difficult to interpret the factorization. This problem is especially ignored in cancer genomics, where NMF is used to infer information about the mutational processes present in the evolution of cancer. In this paper the extent of non-uniqueness is investigated for two mutational counts data, and a new sampling algorithm, that can find the SFS, is introduced. Our sampling algorithm is easy to implement and applies to an arbitrary rank of NMF. This is in contrast to state of the art, where the NMF rank must be smaller than or equal to four. For lower ranks we show that our algorithm performs similarly to the polygon inflation algorithm that is developed in relations to chemometrics. Furthermore, we show how the size of the SFS can have a high influence on the appearing variability of a solution. Our sampling algorithm is implemented in an R package SFS (<https://github.com/ragnhildlaursen/SFS>).



page 9


A Quantum-inspired Classical Algorithm for Separable Non-negative Matrix Factorization

Non-negative Matrix Factorization (NMF) asks to decompose a (entry-wise)...

Model selection for robust learning of mutational signatures using Negative Binomial non-negative matrix factorization

The spectrum of mutations in a collection of cancer genomes can be descr...

Low-Rank Reorganization via Proportional Hazards Non-negative Matrix Factorization Unveils Survival Associated Gene Clusters

One of the central goals of precision health is the understanding and in...

Identifying Population Movements with Non-Negative Matrix Factorization from Wi-Fi User Counts in Smart and Connected Cities

Non-Negative Matrix Factorization (NMF) is a valuable matrix factorizati...

Robust Large Scale Non-negative Matrix Factorization using Proximal Point Algorithm

A robust algorithm for non-negative matrix factorization (NMF) is presen...

Towards Adapting NMF Dictionaries Using Total Variability Modeling for Noise-Robust Acoustic Features

We propose an algorithm to extract noise-robust acoustic features from n...

Clustering US States by Time Series of COVID-19 New Case Counts with Non-negative Matrix Factorization

The spreading pattern of COVID-19 differ a lot across the US states unde...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.