MatSciML: A Broad, Multi-Task Benchmark for Solid-State Materials Modeling

09/12/2023
by   Kin Long Kelvin Lee, et al.
0

We propose MatSci ML, a novel benchmark for modeling MATerials SCIence using Machine Learning (MatSci ML) methods focused on solid-state materials with periodic crystal structures. Applying machine learning methods to solid-state materials is a nascent field with substantial fragmentation largely driven by the great variety of datasets used to develop machine learning models. This fragmentation makes comparing the performance and generalizability of different methods difficult, thereby hindering overall research progress in the field. Building on top of open-source datasets, including large-scale datasets like the OpenCatalyst, OQMD, NOMAD, the Carolina Materials Database, and Materials Project, the MatSci ML benchmark provides a diverse set of materials systems and properties data for model training and evaluation, including simulated energies, atomic forces, material bandgaps, as well as classification data for crystal symmetries via space groups. The diversity of properties in MatSci ML makes the implementation and evaluation of multi-task learning algorithms for solid-state materials possible, while the diversity of datasets facilitates the development of new, more generalized algorithms and methods across multiple datasets. In the multi-dataset learning setting, MatSci ML enables researchers to combine observations from multiple datasets to perform joint prediction of common properties, such as energy and forces. Using MatSci ML, we evaluate the performance of different graph neural networks and equivariant point cloud networks on several benchmark tasks spanning single task, multitask, and multi-data learning scenarios. Our open-source code is available at https://github.com/IntelLabs/matsciml.

READ FULL TEXT
research
02/16/2021

Predicting Material Properties Using a 3D Graph Neural Network with Invariant Local Descriptors

Accurately predicting material properties is critical for discovering an...
research
10/31/2022

The Open MatSci ML Toolkit: A Flexible Framework for Machine Learning in Materials Science

We present the Open MatSci ML Toolkit: a flexible, self-contained, and s...
research
03/14/2023

Recent Advances and Applications of Machine Learning in Experimental Solid Mechanics: A Review

For many decades, experimental solid mechanics has played a crucial role...
research
06/17/2022

The Open Catalyst 2022 (OC22) Dataset and Challenges for Oxide Electrocatalysis

Computational catalysis and machine learning communities have made consi...
research
07/14/2021

Generative and reproducible benchmarks for comprehensive evaluation of machine learning classifiers

Understanding the strengths and weaknesses of machine learning (ML) algo...
research
10/06/2021

Tribuo: Machine Learning with Provenance in Java

Machine Learning models are deployed across a wide range of industries, ...
research
07/07/2019

IRNet: A General Purpose Deep Residual Regression Framework for Materials Discovery

Materials discovery is crucial for making scientific advances in many do...

Please sign up or login with your details

Forgot password? Click here to reset