Fast, accurate, and transferable many-body interatomic potentials by genetic programming

04/01/2019
by   Alberto Hernández, et al.
0

The length and time scales of atomistic simulations are limited by the computational cost of the methods used to predict material properties. In recent years there has been great progress in the use of machine learning algorithms to develop fast and accurate interatomic potential models, but it remains a challenge to develop models that generalize well and are fast enough to be used at extreme time and length scales. To address this challenge, we have developed a machine learning algorithm based on genetic programming that is capable of discovering accurate, computationally efficient many-body potential models. The key to our approach is to explore a hypothesis space of models based on fundamental physical principles and select models within this hypothesis space based on their accuracy, speed, and simplicity. The focus on simplicity reduces the risk of overfitting the training data and increases the chances of discovering a model that generalizes well. Our algorithm was validated by rediscovering an exact Lennard Jones potential and a Sutton Chen embedded atom method potential from training data generated using these models. By using training data generated from density functional theory calculations, we found potential models for elemental copper that are simple, as fast as embedded atom models, and capable of accurately predicting properties outside of their training set. Our approach requires relatively small sets of training data, making it possible to generate training data using highly accurate methods at a reasonable computational cost. We present our approach, the forms of the discovered models, and assessments of their transferability, accuracy and speed.

READ FULL TEXT
research
04/01/2019

Fast, accurate, and transferable many-body interatomic potentials by symbolic regression

The length and time scales of atomistic simulations are limited by the c...
research
10/27/2022

Generalizability of Functional Forms for Interatomic Potential Models Discovered by Symbolic Regression

In recent years there has been great progress in the use of machine lear...
research
05/18/2023

Multi-Fidelity Machine Learning for Excited State Energies of Molecules

The accurate but fast calculation of molecular excited states is still a...
research
08/24/2021

Data Aggregation for Reducing Training Data in Symbolic Regression

The growing volume of data makes the use of computationally intense mach...
research
08/05/2019

A study in Rashomon curves and volumes: A new perspective on generalization and model simplicity in machine learning

The Rashomon effect occurs when many different explanations exist for th...
research
03/22/2023

Generate labeled training data using Prompt Programming and GPT-3. An example of Big Five Personality Classification

We generated 25000 conversations labeled with Big Five Personality trait...
research
08/18/2006

Searching for Globally Optimal Functional Forms for Inter-Atomic Potentials Using Parallel Tempering and Genetic Programming

We develop a Genetic Programming-based methodology that enables discover...

Please sign up or login with your details

Forgot password? Click here to reset