Decision functions from supervised machine learning algorithms as collective variables for accelerating molecular simulations

02/28/2018
by   Mohammad M. Sultan, et al.
0

Selection of appropriate collective variables for enhancing molecular simulations remains an unsolved problem in computational biophysics. In particular, picking initial collective variables (CVs) is particularly challenging in higher dimensions. Which atomic coordinates or transforms there of from a list of thousands should one pick for enhanced sampling runs? How does a modeler even begin to pick starting coordinates for investigation? This remains true even in the case of simple two state systems and only increases in difficulty for multi-state systems. In this work, we attempt to solve the initial CV problem using a data-driven approach inspired by supervised machine learning literature. In particular, we show how the decision functions in supervised machine learning (SML) algorithms can be used as initial CVs for accelerated sampling. Using solvated alanine dipeptide and Chignolin mini-protein as our test cases, we illustrate how the distance to the Support Vector Machines decision hyperplane, the output probability estimates from Logistic Regression, and other classifiers may be used to reversibly sample slow structural transitions. We discuss the utility of other SML algorithms that might be useful for identifying CVs for accelerating molecular simulations.

READ FULL TEXT

page 6

page 7

research
02/28/2018

Automated design of collective variables using supervised machine learning

Selection of appropriate collective variables for enhancing sampling of ...
research
02/09/2019

Nonlinear Discovery of Slow Molecular Modes using Hierarchical Dynamics Encoders

The success of enhanced sampling molecular simulations that accelerate a...
research
12/30/2017

Molecular enhanced sampling with autoencoders: On-the-fly collective variable discovery and accelerated free energy landscape exploration

Macromolecular and biomolecular folding landscapes typically contain hig...
research
12/06/2021

Collective variable discovery in the age of machine learning: reality, hype and everything in between

Understanding kinetics and thermodynamics profile of biomolecules is nec...
research
10/20/2016

Variational Koopman models: slow collective variables and molecular kinetics from short off-equilibrium simulations

Markov state models (MSMs) and Master equation models are popular approa...
research
05/19/2017

Data-driven Optimal Transport Cost Selection for Distributionally Robust Optimizatio

Recently, (Blanchet, Kang, and Murhy 2016) showed that several machine l...
research
01/22/2019

A microscopic description of acid-base equilibrium

Acid-base reactions are ubiquitous in nature. Understanding their mechan...

Please sign up or login with your details

Forgot password? Click here to reset