Interpreting multi-variate models with setPCA

11/17/2021
by   Nordine Aouni, et al.
0

Principal Component Analysis (PCA) and other multi-variate models are often used in the analysis of "omics" data. These models contain much information which is currently neither easily accessible nor interpretable. Here we present an algorithmic method which has been developed to integrate this information with existing databases of background knowledge, stored in the form of known sets (for instance genesets or pathways). To make this accessible we have produced a Graphical User Interface (GUI) in Matlab which allows the overlay of known set information onto the loadings plot and thus improves the interpretability of the multi-variate model. For each known set the optimal convex hull, covering a subset of elements from the known set, is found through a search algorithm and displayed. In this paper we discuss two main topics; the details of the search algorithm for the optimal convex hull for this problem and the GUI interface which is freely available for download for academic use.

READ FULL TEXT
research
04/07/2018

Principal Component Analysis: A Natural Approach to Data Exploration

Principal component analysis (PCA) is often used for analysing data in t...
research
07/21/2022

Quantum search in sets with prior knowledge

Quantum Search Algorithm made a big impact by being able to solve the se...
research
04/20/2010

PCA 4 DCA: The Application Of Principal Component Analysis To The Dendritic Cell Algorithm

As one of the newest members in the field of artificial immune systems (...
research
10/27/2018

Hull Form Optimization with Principal Component Analysis and Deep Neural Network

Designing and modifying complex hull forms for optimal vessel performanc...
research
11/26/2011

Learning a Factor Model via Regularized PCA

We consider the problem of learning a linear factor model. We propose a ...

Please sign up or login with your details

Forgot password? Click here to reset