TorsionNet: A Reinforcement Learning Approach to Sequential Conformer Search

06/12/2020
by   Tarun Gogineni, et al.
0

Molecular geometry prediction of flexible molecules, or conformer search, is a long-standing challenge in computational chemistry. This task is of great importance for predicting structure-activity relationships for a wide variety of substances ranging from biomolecules to ubiquitous materials. Substantial computational resources are invested in Monte Carlo and Molecular Dynamics methods to generate diverse and representative conformer sets for medium to large molecules, which are yet intractable to chemoinformatic conformer search methods. We present TorsionNet, an efficient sequential conformer search technique based on reinforcement learning under the rigid rotor approximation. The model is trained via curriculum learning, whose theoretical benefit is explored in detail, to maximize a novel metric grounded in thermodynamics called the Gibbs Score. Our experimental results show that TorsionNet outperforms the highest scoring chemoinformatics method by 4x on large branched alkanes, and by several orders of magnitude on the previously unexplored biopolymer lignin, with applications in renewable energy.

READ FULL TEXT
research
02/09/2021

Graph Energy-based Model for Substructure Preserving Molecular Design

It is common practice for chemists to search chemical databases based on...
research
09/25/2019

A Generative Model for Molecular Distance Geometry

Computing equilibrium states for many-body systems, such as molecules, i...
research
10/30/2020

Goal directed molecule generation using Monte Carlo Tree Search

One challenging and essential task in biochemistry is the generation of ...
research
07/15/2023

Variational Monte Carlo on a Budget – Fine-tuning pre-trained Neural Wavefunctions

Obtaining accurate solutions to the Schrödinger equation is the key chal...
research
06/18/2020

Practical Large-Scale Distributed Parallel Monte-Carlo Tree Search Applied to Molecular Design

It is common practice to use large computational resources to train neur...
research
05/23/2023

Evaluation of the MACE Force Field Architecture: from Medicinal Chemistry to Materials Science

The MACE architecture represents the state of the art in the field of ma...
research
11/14/2017

Protofold II: Enhanced Model and Implementation for Kinetostatic Protein Folding

A reliable prediction of 3D protein structures from sequence data remain...

Please sign up or login with your details

Forgot password? Click here to reset