Machine-learned metrics for predicting the likelihood of success in materials discovery

11/25/2019
by   Yoolhee Kim, et al.
0

Materials discovery is often compared to the challenge of finding a needle in a haystack. While much work has focused on accurately predicting the properties of candidate materials with machine learning (ML), which amounts to evaluating whether a given candidate is a piece of straw or a needle, less attention has been paid to a critical question: Are we searching in the right haystack? We refer to the haystack as the design space for a particular materials discovery problem (i.e. the set of possible candidate materials to synthesize), and thus frame this question as one of design space selection. In this paper, we introduce two metrics, the Predicted Fraction of Improved Candidates (PFIC), and the Cumulative Maximum Likelihood of Improvement (CMLI), which we demonstrate can identify discovery-rich and discovery-poor design spaces, respectively. Using CMLI and PFIC together to identify optimal design spaces can significantly accelerate ML-driven materials discovery.

READ FULL TEXT

page 1

page 2

research
11/25/2019

Machine-learned metrics for predicting thelikelihood of success in materials discovery

Materials discovery is often compared to the challenge of finding a need...
research
05/25/2021

Analogical discovery of disordered perovskite oxides by crystal structure information hidden in unsupervised material fingerprints

Compositional disorder induces myriad captivating phenomena in perovskit...
research
08/28/2023

Matbench Discovery – An evaluation framework for machine learning crystal stability prediction

Matbench Discovery simulates the deployment of machine learning (ML) ene...
research
03/20/2018

Accelerating Materials Development via Automation, Machine Learning, and High-Performance Computing

Successful materials innovations can transform society. However, materia...
research
11/27/2014

Pattern Decomposition with Complex Combinatorial Constraints: Application to Materials Discovery

Identifying important components or factors in large amounts of noisy da...
research
04/18/2023

METAM: Goal-Oriented Data Discovery

Data is a central component of machine learning and causal inference tas...

Please sign up or login with your details

Forgot password? Click here to reset