PANTHER: Pathway Augmented Nonnegative Tensor factorization for HighER-order feature learning

12/15/2020
by   Yuan Luo, et al.
0

Genetic pathways usually encode molecular mechanisms that can inform targeted interventions. It is often challenging for existing machine learning approaches to jointly model genetic pathways (higher-order features) and variants (atomic features), and present to clinicians interpretable models. In order to build more accurate and better interpretable machine learning models for genetic medicine, we introduce Pathway Augmented Nonnegative Tensor factorization for HighER-order feature learning (PANTHER). PANTHER selects informative genetic pathways that directly encode molecular mechanisms. We apply genetically motivated constrained tensor factorization to group pathways in a way that reflects molecular mechanism interactions. We then train a softmax classifier for disease types using the identified pathway groups. We evaluated PANTHER against multiple state-of-the-art constrained tensor/matrix factorization models, as well as group guided and Bayesian hierarchical models. PANTHER outperforms all state-of-the-art comparison models significantly (p<0.05). Our experiments on large scale Next Generation Sequencing (NGS) and whole-genome genotyping datasets also demonstrated wide applicability of PANTHER. We performed feature analysis in predicting disease types, which suggested insights and benefits of the identified pathway groups.

READ FULL TEXT
research
05/28/2022

Additive Higher-Order Factorization Machines

In the age of big data and interpretable machine learning, approaches ne...
research
06/10/2023

TensorNet: Cartesian Tensor Representations for Efficient Learning of Molecular Potentials

The development of efficient machine learning models for molecular syste...
research
01/18/2021

HyperNTF: A Hypergraph Regularized Nonnegative Tensor Factorization for Dimensionality Reduction

Most methods for dimensionality reduction are based on either tensor rep...
research
11/02/2017

Efficient Constrained Tensor Factorization by Alternating Optimization with Primal-Dual Splitting

Tensor factorization with hard and/or soft constraints has played an imp...
research
05/14/2018

Integrating Hypertension Phenotype and Genotype with Hybrid Non-negative Matrix Factorization

Hypertension is a heterogeneous syndrome in need of improved subtyping u...
research
09/27/2018

Cancer classification and pathway discovery using non-negative matrix factorization

Extracting genetic information from a full range of sequencing data is i...
research
11/27/2019

Conditional Hierarchical Bayesian Tucker Decomposition

Our research focuses on studying and developing methods for reducing the...

Please sign up or login with your details

Forgot password? Click here to reset