Low-Rank Reorganization via Proportional Hazards Non-negative Matrix Factorization Unveils Survival Associated Gene Clusters

08/09/2020
by   Zhi Huang, et al.
11

One of the central goals of precision health is the understanding and interpretation of high-dimensional biological data to identify genes and markers associated with disease initiation, development and outcomes. Significant effort has been committed to harness gene expression data as real-valued matrices for multiple analyses while accounting for time-to-event modeling by including survival times. Traditional biological analysis has focused separately on non-negative matrix factorization (NMF) of the gene expression data matrix and survival regression with Cox proportional hazards model. In this work, Cox proportional hazards regression is integrated with NMF by imposing survival constraints. This is accomplished by jointly optimizing the Frobenius norm and partial log likelihood for events such as death or relapse. Simulation results based on synthetic data demonstrated the superiority of the proposed methodology, when compared to other NMF algorithms, in finding survival associated gene clusters. In addition, using breast cancer gene expression data, the proposed technique can unravel critical clusters of cancer genes. The discovered gene clusters reflect rich biological implications and can help identify survival-related biomarkers. Towards the goal of precision health and cancer treatments, the proposed algorithm can help understand and interpret high-dimensional heterogeneous genomics data with accurate identification of survival-associated gene clusters.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/09/2020

Data embedding and prediction by sparse tropical matrix factorization

Matrix factorization methods are linear models, with limited capability ...
research
01/19/2021

A sampling algorithm to compute the set of feasible solutions for non-negative matrix factorization with an arbitrary rank

Non-negative Matrix Factorization (NMF) is a useful method to extract fe...
research
09/27/2018

Cancer classification and pathway discovery using non-negative matrix factorization

Extracting genetic information from a full range of sequencing data is i...
research
07/15/2020

Prediction of Cancer Microarray and DNA Methylation Data using Non-negative Matrix Factorization

Over the past few years, there has been a considerable spread of microar...
research
08/30/2023

Multiple Augmented Reduced Rank Regression for Pan-Cancer Analysis

Statistical approaches that successfully combine multiple datasets are m...
research
10/14/2020

Low-rank Convex/Sparse Thermal Matrix Approximation for Infrared-based Diagnostic System

Active and passive thermography are two efficient techniques extensively...
research
03/12/2019

Towards Unsupervised Cancer Subtyping: Predicting Prognosis Using A Histologic Visual Dictionary

Unlike common cancers, such as those of the prostate and breast, tumor g...

Please sign up or login with your details

Forgot password? Click here to reset