Understanding the Sources of Error in MBAR through Asymptotic Analysis

03/02/2022
by   Xiang Sherry Li, et al.
0

Multiple sampling strategies commonly used in molecular dynamics, such as umbrella sampling and alchemical free energy methods, involve sampling from multiple thermodynamic states. Commonly, the data are then recombined to construct estimates of free energies and ensemble averages using the Multistate Bennett Acceptance Ratio (MBAR) formalism. However, the error of the MBAR estimator is not well-understood: previous error analysis of MBAR assumed independent samples and did not permit attributing contributions to the total error to individual thermodynamic states. In this work, we derive a novel central limit theorem for MBAR estimates. This central limit theorem yields an error estimator which can be decomposed into contributions from the individual Markov chains used to sample the states. We demonstrate the error estimator for an umbrella sampling calculation of the alanine dipeptide in two dimensions and an alchemical calculation of the hydration free energy of methane. In both cases, the states' individual contributions to the error provide insight into the sources of error of the simulations. Our numerical results demonstrate that the time required for the Markov chain to decorrelate in individual thermodynamic states contributes considerably to the total MBAR error. Moreover, they indicate that it may be possible to use the contributions to tune the sampling and improve the accuracy of MBAR calculations.

READ FULL TEXT

page 10

page 11

research
05/18/2018

On a Metropolis-Hastings importance sampling estimator

A classical approach for approximating expectations of functions w.r.t. ...
research
05/18/2018

Markov Chain Importance Sampling - a highly efficient estimator for MCMC

Markov chain algorithms are ubiquitous in machine learning and statistic...
research
10/05/2022

Functional Central Limit Theorem and Strong Law of Large Numbers for Stochastic Gradient Langevin Dynamics

We study the mixing properties of an important optimization algorithm of...
research
02/03/2021

Missing Mass of Rank-2 Markov Chains

Estimation of missing mass with the popular Good-Turing (GT) estimator i...
research
06/16/2021

Central limit theorem for kernel estimator of invariant density in bifurcating Markov chains models

Bifurcating Markov chains (BMC) are Markov chains indexed by a full bina...
research
09/16/2019

Computing spatially resolved rotational hydration entropies from atomistic simulations

For a first principles understanding of macromolecular processes, a quan...
research
12/08/2020

Central limit theorem for bifurcating Markov chains

Bifurcating Markov chains (BMC) are Markov chains indexed by a full bina...

Please sign up or login with your details

Forgot password? Click here to reset