Machine learning to tame divergent density functional approximations: a new path to consensus materials design principles

06/24/2021
by   Chenru Duan, et al.
0

Computational virtual high-throughput screening (VHTS) with density functional theory (DFT) and machine-learning (ML)-acceleration is essential in rapid materials discovery. By necessity, efficient DFT-based workflows are carried out with a single density functional approximation (DFA). Nevertheless, properties evaluated with different DFAs can be expected to disagree for the cases with challenging electronic structure (e.g., open shell transition metal complexes, TMCs) for which rapid screening is most needed and accurate benchmarks are often unavailable. To quantify the effect of DFA bias, we introduce an approach to rapidly obtain property predictions from 23 representative DFAs spanning multiple families and "rungs" (e.g., semi-local to double hybrid) and basis sets on over 2,000 TMCs. Although computed properties (e.g., spin-state ordering and frontier orbital gap) naturally differ by DFA, high linear correlations persist across all DFAs. We train independent ML models for each DFA and observe convergent trends in feature importance; these features thus provide DFA-invariant, universal design rules. We devise a strategy to train ML models informed by all 23 DFAs and use them to predict properties (e.g., spin-splitting energy) of over 182k TMCs. By requiring consensus of the ANN-predicted DFA properties, we improve correspondence of these computational lead compounds with literature-mined, experimental compounds over the single-DFA approach typically employed. Both feature analysis and consensus-based ML provide efficient, alternative paths to overcome accuracy limitations of practical DFT.

READ FULL TEXT

page 6

page 9

page 13

page 16

page 22

page 41

research
09/18/2022

Low-cost machine learning approach to the prediction of transition metal phosphor excited state properties

Photoactive iridium complexes are of broad interest due to their applica...
research
11/02/2021

Audacity of huge: overcoming challenges of data scarcity and data quality for machine learning in computational materials discovery

Machine learning (ML)-accelerated discovery requires large amounts of hi...
research
11/22/2022

PhAST: Physics-Aware, Scalable, and Task-specific GNNs for Accelerated Catalyst Design

Mitigating the climate crisis requires a rapid transition towards lower ...
research
01/04/2023

Machine-Learning Prediction of the Computed Band Gaps of Double Perovskite Materials

Prediction of the electronic structure of functional materials is essent...
research
01/11/2022

Two Wrongs Can Make a Right: A Transfer Learning Approach for Chemical Discovery with Chemical Accuracy

Appropriately identifying and treating molecules and materials with sign...
research
03/02/2022

Machine learning models predict calculation outcomes with the transferability necessary for computational catalysis

Virtual high throughput screening (VHTS) and machine learning (ML) have ...

Please sign up or login with your details

Forgot password? Click here to reset