Reliable and Explainable Machine Learning Methods for Accelerated Material Discovery

01/05/2019
by   Bhavya Kailkhura, et al.
0

Material scientists are increasingly adopting the use of machine learning (ML) for making potentially important decisions, such as, discovery, development, optimization, synthesis and characterization of materials. However, despite ML's impressive performance in commercial applications, several unique challenges exist when applying ML in materials science applications. In such a context, the contributions of this work are twofold. First, we identify common pitfalls of existing ML techniques when learning from underrepresented/imbalanced material data. Specifically, we show that with imbalanced data, standard methods for assessing quality of ML models break down and lead to misleading conclusions. Furthermore, we found that the model's own confidence score cannot be trusted and model introspection methods (using simpler models) do not help as they result in loss of predictive performance (reliability-explainability trade-off). Second, to overcome these challenges, we propose a general-purpose explainable and reliable machine-learning framework. Specifically, we propose a novel pipeline that employs an ensemble of simpler models to reliably predict material properties. We also propose a transfer learning technique and show that the performance loss due to models' simplicity can be overcome by exploiting correlations among different material properties. A new evaluation metric and a trust score to better quantify the confidence in the predictions are also proposed. To improve the interpretability, we add a rationale generator component to our framework which provides both model-level and decision-level explanations. Finally, we demonstrate the versatility of our technique on two applications: 1) predicting properties of crystalline compounds, and 2) identifying novel potentially stable solar cell materials.

READ FULL TEXT
research
11/01/2021

Interpretable and Explainable Machine Learning for Materials Science and Chemistry

While the uptake of data-driven approaches for materials science and che...
research
08/08/2023

Explainable machine learning to enable high-throughput electrical conductivity optimization of doped conjugated polymers

The combination of high-throughput experimentation techniques and machin...
research
11/03/2022

Data-based Polymer-Unit Fingerprint (PUFp): A Newly Accessible Expression of Polymer Organic Semiconductors for Machine Learning

In the process of finding high-performance organic semiconductors (OSCs)...
research
11/02/2021

Audacity of huge: overcoming challenges of data scarcity and data quality for machine learning in computational materials discovery

Machine learning (ML)-accelerated discovery requires large amounts of hi...
research
07/07/2019

IRNet: A General Purpose Deep Residual Regression Framework for Materials Discovery

Materials discovery is crucial for making scientific advances in many do...
research
10/19/2022

Machine Learning for a Sustainable Energy Future

Transitioning from fossil fuels to renewable energy sources is a critica...
research
01/15/2023

TextileNet: A Material Taxonomy-based Fashion Textile Dataset

The rise of Machine Learning (ML) is gradually digitalizing and reshapin...

Please sign up or login with your details

Forgot password? Click here to reset