The Open MatSci ML Toolkit: A Flexible Framework for Machine Learning in Materials Science

10/31/2022
by   Santiago Miret, et al.
0

We present the Open MatSci ML Toolkit: a flexible, self-contained, and scalable Python-based framework to apply deep learning models and methods on scientific data with a specific focus on materials science and the OpenCatalyst Dataset. Our toolkit provides: 1. A scalable machine learning workflow for materials science leveraging PyTorch Lightning, which enables seamless scaling across different computation capabilities (laptop, server, cluster) and hardware platforms (CPU, GPU, XPU). 2. Deep Graph Library (DGL) support for rapid graph neural network prototyping and development. By publishing and sharing this toolkit with the research community via open-source release, we hope to: 1. Lower the entry barrier for new machine learning researchers and practitioners that want to get started with the OpenCatalyst dataset, which presently comprises the largest computational materials science dataset. 2. Enable the scientific community to apply advanced machine learning tools to high-impact scientific challenges, such as modeling of materials behavior for clean energy applications. We demonstrate the capabilities of our framework by enabling three new equivariant neural network models for multiple OpenCatalyst tasks and arrive at promising results for compute scaling and model performance.

READ FULL TEXT
research
09/12/2023

MatSciML: A Broad, Multi-Task Benchmark for Solid-State Materials Modeling

We propose MatSci ML, a novel benchmark for modeling MATerials SCIence u...
research
05/05/2023

Materials Informatics: An Algorithmic Design Rule

Materials informatics, data-enabled investigation, is a "fourth paradigm...
research
10/20/2020

The Open Catalyst 2020 (OC20) Dataset and Community Challenges

Catalyst discovery and optimization is key to solving many societal and ...
research
06/28/2021

Cosmic-CoNN: A Cosmic Ray Detection Deep-Learning Framework, Dataset, and Toolkit

Rejecting cosmic rays (CRs) is essential for scientific interpretation o...
research
09/20/2020

Supervised Ontology and Instance Matching with MELT

In this paper, we present MELT-ML, a machine learning extension to the M...
research
12/06/2021

Scalable Geometric Deep Learning on Molecular Graphs

Deep learning in molecular and materials sciences is limited by the lack...
research
09/20/2022

FACT: Learning Governing Abstractions Behind Integer Sequences

Integer sequences are of central importance to the modeling of concepts ...

Please sign up or login with your details

Forgot password? Click here to reset