GIFT: Guided and Interpretable Factorization for Tensors - An Application to Large-Scale Multi-platform Cancer Analysis

01/09/2018
by   Jungwoo Lee, et al.
0

Given multi-platform genome data with prior knowledge of functional gene sets, how can we extract interpretable latent relationships between patients and genes? More specifically, how can we devise a tensor factorization method which produces an interpretable gene factor matrix based on gene set information while maintaining the decomposition quality and speed? We propose GIFT, a Guided and Interpretable Factorization for Tensors. GIFT provides interpretable factor matrices by encoding prior knowledge as a regularization term in its objective function. Experiment results demonstrate that GIFT produces interpretable factorizations with high scalability and accuracy, while other methods lack interpretability. We apply GIFT to the PanCan12 dataset, and GIFT reveals significant relations between cancers, gene sets, and genes, such as influential gene sets for specific cancer (e.g., interferon-gamma response gene set for ovarian cancer) or relations between cancers and genes (e.g., BRCA cancer - APOA1 gene and OV, UCEC cancers - BST2 gene).

READ FULL TEXT
research
12/15/2014

Bayesian multi-tensor factorization

We introduce Bayesian multi-tensor factorization, a model that is the fi...
research
05/21/2018

GSAE: an autoencoder with embedded gene-set nodes for genomics functional characterization

Bioinformatics tools have been developed to interpret gene expression da...
research
11/22/2017

SNeCT: Scalable network constrained Tucker decomposition for integrative multi-platform data analysis

Motivation: How do we integratively analyze large-scale multi-platform g...
research
05/11/2018

TensOrMachine: Probabilistic Boolean Tensor Decomposition

Boolean tensor decomposition approximates data of multi-way binary relat...
research
07/22/2022

Redundancy-aware unsupervised ranking based on game theory – application to gene enrichment analysis

Gene set collections are a common ground to study the enrichment of gene...
research
06/09/2023

Incorporating Prior Knowledge in Deep Learning Models via Pathway Activity Autoencoders

Motivation: Despite advances in the computational analysis of high-throu...
research
02/07/2020

Bidimensional linked matrix factorization for pan-omics pan-cancer analysis

Several modern applications require the integration of multiple large da...

Please sign up or login with your details

Forgot password? Click here to reset