Data-driven Uncertainty Quantification for Systematic Coarse-grained Models

07/01/2020
by   Tangxin Jin, et al.
0

In this work, we present methodologies for the quantification of confidence in bottom-up coarse-grained models for molecular and macromolecular systems. Coarse-graining methods have been extensively used in the past decades in order to extend the length and time scales accessible by simulation methodologies. The quantification, though, of induced errors due to the limited availability of fine-grained data is not yet established. Here, we employ rigorous statistical methods to deduce guarantees for the optimal coarse models obtained via approximations of the multi-body potential of mean force, with the relative entropy, the relative entropy rate minimization, and the force matching methods. Specifically, we present and apply statistical approaches, such as bootstrap and jackknife, to infer confidence sets for a limited number of samples, i.e., molecular configurations. Moreover, we estimate asymptotic confidence intervals assuming adequate sampling of the phase space. We demonstrate the need for non-asymptotic methods and quantify confidence sets through two applications. The first is a two-scale fast/slow diffusion process projected on the slow process. With this benchmark example, we establish the methodology for both independent and time-series data. Second, we apply these uncertainty quantification approaches on a polymeric bulk system. We consider an atomistic polyethylene melt as the prototype system for developing coarse-graining tools for macromolecular systems. For this system, we estimate the coarse-grained force field and present confidence levels with respect to the number of available microscopic data.

READ FULL TEXT
research
12/04/2018

Machine Learning of coarse-grained Molecular Dynamics Force Fields

Atomistic or ab-initio molecular dynamics simulations are widely used to...
research
05/22/2022

Contrastive Learning of Coarse-Grained Force Fields

Coarse-grained models have proven helpful for simulating complex systems...
research
07/24/2018

Model uncertainty estimation in data assimilation for multi-scale systems with partially observed resolved variables

Model uncertainty quantification is an essential component of effective ...
research
02/24/2020

Embedded-physics machine learning for coarse-graining and collective variable discovery without data

We present a novel learning framework that consistently embeds underlyin...
research
07/09/2018

Confidence Intervals for Stochastic Arithmetic

Quantifying errors and losses due to the use of Floating-Point (FP) calc...
research
03/10/2020

Adversarial-residual-coarse-graining: Applying machine learning theory to systematic molecular coarse-graining

We utilize connections between molecular coarse-graining (CG) approaches...
research
03/06/2021

A Statistical Perspective on the Challenges in Molecular Microbial Biology

High throughput sequencing (HTS)-based technology enables identifying an...

Please sign up or login with your details

Forgot password? Click here to reset