Toroidal diffusions and protein structure evolution

This chapter shows how toroidal diffusions are convenient methodological tools for modelling protein evolution in a probabilistic framework. The chapter addresses the construction of ergodic diffusions with stationary distributions equal to well-known directional distributions, which can be regarded as toroidal analogues of the Ornstein-Uhlenbeck process. The important challenges that arise in the estimation of the diffusion parameters require the consideration of tractable approximate likelihoods and, among the several approaches introduced, the one yielding a specific approximation to the transition density of the wrapped normal process is shown to give the best empirical performance on average. This provides the methodological building block for Evolutionary Torus Dynamic Bayesian Network (ETDBN), a hidden Markov model for protein evolution that emits a wrapped normal process and two continuous-time Markov chains per hidden state. The chapter describes the main features of ETDBN, which allows for both "smooth" conformational changes and "catastrophic" conformational jumps, and several empirical benchmarks. The insights into the relationship between sequence and structure evolution that ETDBN provides are illustrated in a case study.

READ FULL TEXT

page 2

page 9

page 12

page 19

page 22

page 23

research
09/02/2023

A modern approach to transition analysis and process mining with Markov models: A tutorial with R

This chapter presents an introduction to Markovian modeling for the anal...
research
04/17/2020

MAP segmentation in Bayesian hidden Markov models: a case study

We consider the problem of estimating the maximum posterior probability ...
research
01/06/2021

Statistical challenges in the analysis of sequence and structure data for the COVID-19 spike protein

As the major target of many vaccines and neutralizing antibodies against...
research
01/31/2019

Geometric fluid approximation for general continuous-time Markov chains

Fluid approximations have seen great success in approximating the macro-...
research
08/11/2023

The divergence time of protein structures modelled by Markov matrices and its relation to the divergence of sequences

A complete time-parameterized statistical model quantifying the divergen...
research
08/08/2013

Predicting protein contact map using evolutionary and physical constraints by integer programming (extended version)

Motivation. Protein contact map describes the pairwise spatial and funct...
research
10/02/2020

Bridging the Gaps in Statistical Models of Protein Alignment

This work demonstrates how a complete statistical model quantifying the ...

Please sign up or login with your details

Forgot password? Click here to reset