Representations and Strategies for Transferable Machine Learning Models in Chemical Discovery

06/20/2021
by   Daniel R. Harper, et al.
0

Strategies for machine-learning(ML)-accelerated discovery that are general across materials composition spaces are essential, but demonstrations of ML have been primarily limited to narrow composition variations. By addressing the scarcity of data in promising regions of chemical space for challenging targets like open-shell transition-metal complexes, general representations and transferable ML models that leverage known relationships in existing data will accelerate discovery. Over a large set (ca. 1000) of isovalent transition-metal complexes, we quantify evident relationships for different properties (i.e., spin-splitting and ligand dissociation) between rows of the periodic table (i.e., 3d/4d metals and 2p/3p ligands). We demonstrate an extension to graph-based revised autocorrelation (RAC) representation (i.e., eRAC) that incorporates the effective nuclear charge alongside the nuclear charge heuristic that otherwise overestimates dissimilarity of isovalent complexes. To address the common challenge of discovery in a new space where data is limited, we introduce a transfer learning approach in which we seed models trained on a large amount of data from one row of the periodic table with a small number of data points from the additional row. We demonstrate the synergistic value of the eRACs alongside this transfer learning strategy to consistently improve model performance. Analysis of these models highlights how the approach succeeds by reordering the distances between complexes to be more consistent with the periodic table, a property we expect to be broadly useful for other materials domains.

READ FULL TEXT

page 41

page 42

research
05/06/2022

Putting Density Functional Theory to the Test in Machine-Learning-Accelerated Materials Discovery

Accelerated discovery with machine learning (ML) has begun to provide th...
research
06/04/2021

Materials Representation and Transfer Learning for Multi-Property Prediction

The adoption of machine learning in materials science has rapidly transf...
research
05/25/2021

Analogical discovery of disordered perovskite oxides by crystal structure information hidden in unsupervised material fingerprints

Compositional disorder induces myriad captivating phenomena in perovskit...
research
10/19/2022

Predicting Oxide Glass Properties with Low Complexity Neural Network and Physical and Chemical Descriptors

Due to their disordered structure, glasses present a unique challenge in...
research
11/22/2022

PhAST: Physics-Aware, Scalable, and Task-specific GNNs for Accelerated Catalyst Design

Mitigating the climate crisis requires a rapid transition towards lower ...
research
06/24/2021

Using Machine Learning and Data Mining to Leverage Community Knowledge for the Engineering of Stable Metal-Organic Frameworks

Although the tailored metal active sites and porous architectures of MOF...
research
12/23/2019

Recreation of the Periodic Table with an Unsupervised Machine Learning Algorithm

In 1869, the first draft of the periodic table was published by Russian ...

Please sign up or login with your details

Forgot password? Click here to reset