Semi-Supervised Non-Parametric Bayesian Modelling of Spatial Proteomics

03/07/2019
by   Oliver M. Crook, et al.
0

Understanding sub-cellular protein localisation is an essential component to analyse context specific protein function. Recent advances in quantitative mass-spectrometry (MS) have led to high resolution mapping of thousands of proteins to sub-cellular locations within the cell. Novel modelling considerations to capture the complex nature of these data are thus necessary. We approach analysis of spatial proteomics data in a non-parametric Bayesian framework, using mixtures of Gaussian process regression models. The Gaussian process regression model accounts for correlation structure within a sub-cellular niche, with each mixture component capturing the distinct correlation structure observed within each niche. Proteins with a priori labelled locations motivate using semi-supervised learning to inform the Gaussian process hyperparameters. We moreover provide an efficient Hamiltonian-within-Gibbs sampler for our model. Furthermore, we reduce the computational burden associated with inversion of covariance matrices by exploiting the structure in the covariance matrix. A tensor decomposition of our covariance matrices allows extended Trench and Durbin algorithms to be applied it order to reduce the computational complexity of inversion and hence accelerate computation. A stand-alone R-package implementing these methods using high-performance C++ libraries is available at: https://github.com/ococrook/toeplitz

READ FULL TEXT
research
08/26/2022

Mixtures of Gaussian Process Experts with SMC^2

Gaussian processes are a key component of many flexible statistical and ...
research
05/23/2018

Trans-Gaussian Kriging in a Bayesian framework : a case study

In the context of Gaussian Process Regression or Kriging, we propose a f...
research
08/17/2021

Semi-parametric Bayesian Additive Regression Trees

We propose a new semi-parametric model based on Bayesian Additive Regres...
research
09/27/2020

Parametric UMAP: learning embeddings with deep neural networks for representation and semi-supervised learning

We propose Parametric UMAP, a parametric variation of the UMAP (Uniform ...
research
09/11/2018

Probabilistic approach to limited-data computed tomography reconstruction

We consider the problem of reconstructing the internal structure of an o...
research
03/25/2020

Highly Scalable Bayesian Geostatistical Modeling via Meshed Gaussian Processes on Partitioned Domains

We introduce a class of scalable Bayesian hierarchical models for the an...
research
06/09/2023

Validation of semi-analytical, semi-empirical covariance matrices for two-point correlation function for Early DESI data

We present an extended validation of semi-analytical, semi-empirical cov...

Please sign up or login with your details

Forgot password? Click here to reset