Machine Learning Prediction of Accurate Atomization Energies of Organic Molecules from Low-Fidelity Quantum Chemical Calculations

by   Logan Ward, et al.

Recent studies illustrate how machine learning (ML) can be used to bypass a core challenge of molecular modeling: the tradeoff between accuracy and computational cost. Here, we assess multiple ML approaches for predicting the atomization energy of organic molecules. Our resulting models learn the difference between low-fidelity, B3LYP, and high-accuracy, G4MP2, atomization energies, and predict the G4MP2 atomization energy to 0.005 eV (mean absolute error) for molecules with less than 9 heavy atoms and 0.012 eV for a small set of molecules with between 10 and 14 heavy atoms. Our two best models, which have different accuracy/speed tradeoffs, enable the efficient prediction of G4MP2-level energies for large molecules and are available through a simple web interface.


A Universal Density Matrix Functional from Molecular Orbital-Based Machine Learning: Transferability across Organic Molecules

We address the degree to which machine learning can be used to accuratel...

Infrared spectra of neutral polycyclic aromatic hydrocarbons by machine learning

The Interest in polycyclic aromatic hydrocarbons (PAHs) spans numerous f...

Multi-fidelity prediction of molecular optical peaks with deep learning

Optical properties are central to molecular design for many applications...

A community-powered search of machine learning strategy space to find NMR property prediction models

The rise of machine learning (ML) has created an explosion in the potent...

On the equivalence of molecular graph convolution and molecular wave function with poor basis set

In this study, we demonstrate that the linear combination of atomic orbi...

BenchML: an extensible pipelining framework for benchmarking representations of materials and molecules at scale

We introduce a machine-learning (ML) framework for high-throughput bench...