Data embedding and prediction by sparse tropical matrix factorization

12/09/2020
by   Amra Omanović, et al.
16

Matrix factorization methods are linear models, with limited capability to model complex relations. In our work, we use tropical semiring to introduce non-linearity into matrix factorization models. We propose a method called Sparse Tropical Matrix Factorization (STMF) for the estimation of missing (unknown) values. We evaluate the efficiency of the STMF method on both synthetic data and biological data in the form of gene expression measurements downloaded from The Cancer Genome Atlas (TCGA) database. Tests on unique synthetic data showed that STMF approximation achieves a higher correlation than non-negative matrix factorization (NMF), which is unable to recover patterns effectively. On real data, STMF outperforms NMF on six out of nine gene expression datasets. While NMF assumes normal distribution and tends toward the mean value, STMF can better fit to extreme values and distributions. STMF is the first work that uses tropical semiring on sparse data. We show that in certain cases semirings are useful because they consider the structure, which is different and simpler to understand than it is with standard linear algebra.

READ FULL TEXT

page 19

page 21

page 23

page 25

page 27

page 29

page 31

page 33

research
05/13/2022

FastSTMF: Efficient tropical matrix factorization algorithm for sparse data

Matrix factorization, one of the most popular methods in machine learnin...
research
08/09/2020

Low-Rank Reorganization via Proportional Hazards Non-negative Matrix Factorization Unveils Survival Associated Gene Clusters

One of the central goals of precision health is the understanding and in...
research
10/20/2018

SL^2MF: Predicting Synthetic Lethality in Human Cancers via Logistic Matrix Factorization

Synthetic lethality (SL) is a promising concept for novel discovery of a...
research
02/13/2020

Disentangling Overlapping Beliefs by Structured Matrix Factorization

Much work on social media opinion polarization focuses on identifying se...
research
12/17/2018

Bayesian Mean-parameterized Nonnegative Binary Matrix Factorization

Binary data matrices can represent many types of data such as social net...
research
07/06/2022

A flexible model-based framework for robust estimation of mutational signatures

Somatic mutations in cancer can be viewed as a mixture distribution of s...
research
01/08/2014

Robust Large Scale Non-negative Matrix Factorization using Proximal Point Algorithm

A robust algorithm for non-negative matrix factorization (NMF) is presen...

Please sign up or login with your details

Forgot password? Click here to reset