Catalyst Property Prediction with CatBERTa: Unveiling Feature Exploration Strategies through Large Language Models

09/01/2023
by   Janghoon Ock, et al.
0

Efficient catalyst screening necessitates predictive models for adsorption energy, a key property of reactivity. However, prevailing methods, notably graph neural networks (GNNs), demand precise atomic coordinates for constructing graph representations, while integrating observable attributes remains challenging. This research introduces CatBERTa, an energy prediction Transformer model using textual inputs. Built on a pretrained Transformer encoder, CatBERTa processes human-interpretable text, incorporating target features. Attention score analysis reveals CatBERTa's focus on tokens related to adsorbates, bulk composition, and their interacting atoms. Moreover, interacting atoms emerge as effective descriptors for adsorption configurations, while factors such as bond length and atomic properties of these atoms offer limited predictive contributions. By predicting adsorption energy from the textual representation of initial structures, CatBERTa achieves a mean absolute error (MAE) of 0.75 eV-comparable to vanilla Graph Neural Networks (GNNs). Furthermore, the subtraction of the CatBERTa-predicted energies effectively cancels out their systematic errors by as much as 19.3 for chemically similar systems, surpassing the error reduction observed in GNNs. This outcome highlights its potential to enhance the accuracy of energy difference predictions. This research establishes a fundamental framework for text-based catalyst property prediction, without relying on graph representations, while also unveiling intricate feature-property relationships.

READ FULL TEXT

page 12

page 14

page 42

research
11/23/2020

Comparison of Atom Representations in Graph Neural Networks for Molecular Property Prediction

Graph neural networks have recently become a standard method for analysi...
research
03/18/2022

Towards Training Billion Parameter Graph Neural Networks for Atomic Simulations

Recent progress in Graph Neural Networks (GNNs) for modeling atomic simu...
research
01/30/2023

Causality-based CTR Prediction using Graph Neural Networks

As a prevalent problem in online advertising, CTR prediction has attract...
research
06/26/2023

Accelerating Molecular Graph Neural Networks via Knowledge Distillation

Recent advances in graph neural networks (GNNs) have allowed molecular s...
research
06/19/2023

Substitutional Alloying Using Crystal Graph Neural Networks

Materials discovery, especially for applications that require extreme op...
research
09/15/2022

Multi-Task Mixture Density Graph Neural Networks for Predicting Cu-based Single-Atom Alloy Catalysts for CO2 Reduction Reaction

Graph neural networks (GNNs) have drawn more and more attention from mat...

Please sign up or login with your details

Forgot password? Click here to reset